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fn (54) Title: NOVEL GENETIC MARKERS FOR LEUKEMIAS 

(57) Abstract: The present invention is related to methods for detecting leukemia cells by determine the expression profile of a group 
of markers. In particular, the type or subtype of leukemia cells in a sample is determined. Further, uses of the group of markers is 
}^ discloses and compositions comprising these markers. 
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Novel Genetic Markers for Leukemias 

The present invention is related to methods for detecting leukemia cells by 
determing the expression profile of a group of markers. In particular, the type or 
subtype of leukemia cells in a sample is determined. Further, uses of the group of 
markers are disclosed and compositions comprising these markers. 

5 In the present specification, a number of documents is cited. The disclosure 
content of these documents including manufacturers' manuals, is herewith 
incorporated by reference. This holds particular true for the documents such as 
gene accession numbers cited in Tables 43a, b, 44 and 45 providing the complete 
nucleotide sequence of marker genes/cDNAs. In other terms, by reciting these 
10 documents, applicant intends to incorporate the complete nucleotide/amino acid 
sequence of those markers where only a partial sequence has been identified in 
the appended Tables. It is also intended to include the (poly)peptide sequences 
translated from these nucleotide sequences within the disclosure content of the 
present specification. 

15 Today leukemias are classified into four different groups or types: acute myeloid 
(AML), acute lymphatic (ALL), chronic myeloid (CML) and chronic lymphatic 
leukemia (CLL). Within these groups, several subcategories can be identified 
further using a panel of standard techniques as described below. The incidence of 
leukemias is increasing with age and is 5/100.000/year in AML, 1/100.000/year in 

20 ALL, 1/100.000 in CML and 6/100.000/year in CLL. Several methods for 
classification have to be applied at diagnosis and before treatment starts: 
cytomorphology and cytochemistry, multiparameter -immunophenotyping, 
cytogenetics including fluorescence in situ hybridization, and molecular techniques 
such as polymerase chain reaction (PCR). So far only a combination of these 

25 techniques allows a precise diagnosis which is necessary to apply state of the art 
treatment. As the exact diagnosis is mandatory for example in CML the detection 
of a specific cytogenetic abnormality, the translocation (9;22) or its molecular 
counterpart, the BCR/ABL rearrangement is required to establish the diagnosis of 
CML. While all patients with CML show a BCR-ABL-rearrangement and are 

30 therefore homogenous with regard to the primary genetic abnormality, in AML and 
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ALL at least 10-15 different subgroups have been identified on the morphological, 
genetical or molecular level. Also in CLL several subgroups can be clearly 
separated. These different subcatgories in leukemias are associated with varying 
clinical outcome and therefore are the basis for different treatment strategies. The 
5 importance of highly specific classification may be illustrated in detail further for 
the AML as a very heterogeneous group of diseases. 

Data from clinical trials showed that outcome of patients with AML differs in a 
broad range. Several parameters influencing prognosis have been identified. 
These can be assigned to different categories: patients' characteristics (i.e. age, 

10 comorbidity), therapy, and biology of the AML. Therefore, a lot of effort was 
invested to identify biological entities and to distinguish subgroups of AML which 
are associated with a favorable, intermediate or unfavorable prognosis, 
respectively. In order to allow a comparison between different studies a 
classification of AML was mandatory. In 1 976 the FAB classification was proposed 

15 by the French-American-British co-operative group which was based on 
cytomorphology and cytochemistry in order to separate AML subgroups according 
to the morphological appearance of blasts in the! blood and bone marrow. In 
addition, it was recognized that genetic abnormalities occurring in the leukemic 
blast had a major impact on the morphological picture and even more on the 

20 prognosis. So far, the karyotype of the leukemic blasts is the most important 
independent prognostic factor regarding response to therapy as well as survival. 
For clinical purposes karyotype analysis allows to discriminate between three 
major prognostic groups. A favorable outcome under currently used treatment 
regimens with cure rates from 50 % up to 858 was observed in several studies in 

25 patients with a) t (8;21) (q22; q22) occuring in AML M2, b) inv (16) (p13q22) 
occurring in; AML M4eo and c) t(15;17) (q22; qll-12) occurring in AML M3/H3v. In 
contrast, chromosome aberrations with an unfavorable clinical course are - 
5/del(5q), -7/de1(7q), inv(3)/t(3:31 and complex aberrant karyotypes with cure 
rates of only 10%. The remainder of AML patients are assigned to a prognosticate 

30 intermediate group. This latter group is very heterogeneous because it includes 
patients with a normal karyotype as well as those with rare chromosome 
aberrations with yet unknown prognostic impact. 

The sub-classification of leukemias becomes increasingly important to guide 
therapy. Thus, the development of new, specific treatment approaches requires 
35 the identification of specific subtypes that may benefit from a distinct therapeutic 
protocol. It has already been shown in two entities that the development of specific 
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drugs can improve outcome of distinct subsets of leukemia. One important 
example is the development of a new therapeutic drug (STI571 ) for the treatment 
of chronic myeloid leukemia (ML): this designed molecule inhibits the CML specific 
chimeric tyrosine kinase - BCR=ABtr generated froirrthe genetic defect observed in 
5 CML, the BCR-ABL-rearrangement due to the translocation between 
chromosomes 3 and 22 (t(9;22) (q34; q1 1)). First data show that therapy response 
is dramatically higher in patients treated with this new drug as compared' to all 
other drugs that had been used so far. Another example is the subtype of acute 
myeloid leukemia AML M3 and its variant M3v both with karyotype t[15;17)(q22; 

10 q11-12). The introduction of a new drug (all-trans retinoic acid - ATRA) has 
improved the outcome in this subgroup of patient from about 50% to 85 % long- 
term survivors; As it is mandatory for these patients suffering from these specific 
leukemia subtypes to be identified as fast as possible so that the best therapy can 
be applied, diagnostics today must accomplish sub-classification with maximal 

1 5 precision. Not only for these subtypes but also for several other leukemia subtypes 
different treatment approaches could improve outcome. Therefore, rapid and 
precise identification of distinct leukemia subtypes is the future goal for 
diagnostics. 

So far a combination of methods is necessary to obtain the most important 

20 information in leukemia diagnostics: Analysis of the morphology and cytochemistry 
of bone marrow blasts and peripheral blood cells is necessary to establish the 
diagnosis. In some cases the addition of immunophenotyping is mandatory to 
separate very undifferentiated AML from acute lymphoblastic leukemia and CLL 
Leukemia subtypes investigated can be diagnosed by cytomorphology alone, only 

25 if an expert reviews the smears. However, a genetic analysis based on 
chromosome analysis, fluorescence in situ hybridization or RT-PCR and 
immunophenotyping is required in order to assign all cases in to the right category. 
The aim of these techniques besides diagnosis is mainly to determine the 
prognosis of the leukemia. A major disadvantage of these methods, however, is 

30 that viable cells are necessary as the cells for genetic analysis have to divide in 
vitro in order to obtain metaphases for the analysis. Another problem is the long 
time of 72 hours from receipt of the material in the laboratory to obtain the result. 
Furthermore, great experience in preparation of chromosomes and even more in 
analyzing the karyotypes is required to obtain the correct result in at least 90% of 

35 cases. These experts in their field are necessary for all other techniques 
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mentioned above as well. Accordingly, standard diagnosis of leukemia uses a 
combination of complementary methods, is expensive, time-consuming, and 
requires experienced experts in the field. Methods that have to be combined are 
cytomorphology or histomorphology, multiparameter-immunophenotyping, 
5 cytogenetics, fluorescence in situ hybridization, and molecular genetics such as 
polymerase chain reaction based assays. 

Using these techniques in combination, hematological malignancies in a first 
approach are separated into chronic myeloid leukemia (CML), chronic lymphoid 
(CLL), acute lymphoblastic (ALL), and acute myeloid leukemia (AML). Within the 

10 latter three disease entities several prognostically relevant subtypes have been 
established. As a second approach this further subclassification is based mainly 
on genetic abnormalities of the leukemic blasts and clearly is associated with 
different prognoses. Therefore, this subclassification is increasingly important to 
guide therapy. Furthermore, the development of new, specific treatment 

15 approaches requires precise identification of leukemia subtypes. 

In a first study Golub et al. (Science 1999) showed that gene expression profiles 
can be used for class prediction and discriminated AML from ALL samples. 
However, for his analysis of acute leukemias the selection of the two different 
subgroups was performed using exclusively morphologic-phenotypical criteria. 
20 This was only descriptive and does not provide deeper insights into the 
pathogenesis or the underlying biology of the leukemia. The approach reproduces 
only very basic knowledge of cytomorphology and intends to differentiate classes. 
The data is not sufficient to predict prognostically relevant cytogenetic aberrations. 

Thus, the technical problem underlying the present invention was to provide 
25 means for leukemia diagnostics which overcome the disadvantages of the prior art 
diagnostic methods. 

The solution to said technical problem is achieved by providing the embodiments 
characterized in the claims. Accordingly, the present invention relates to a method 
of determining whether a patient sample contains leukemia cells or other cells 
30 comprising the steps of a) determining the expression profile of a group of markers 
in a patient sample and b) concluding from the expression profile whether the 
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patient sample contains leukemia cells or other cells characterized in that the 
group of markers consists of markers selected independently from the markers 
listed in one or more of the tables 3 to 6, tables 15 to 20, tables 29, 30, 41, or 42 
and whereby the number of markers in the group- is between one and the total 
5 number of markers listed in the tables 3 to 6, tables 15 to 20, and tables 29, 30, 
41 , or 42. In a particular embodiment therof, the present invention pertains to a 
method wherein leukemia type and subtype are simultaneously determined 
whereby a microarray for the detection of the expression level of a marker or a 
group of markers is used. 

10 It is important to note that in accordance with the invention in all pertaining 
embodiments any possible combination of markers, said markers being disclosed 
in the respective table or tables is encompassed within the scope of the invention. 

As used herein, the term "expression" refers to the process by which mRNA or a 
polypeptide is produced based on the nucleic acid sequence of a gene. The 
15 process includes both transcription and translation, i.e. expression" shall also 
include the formation of mRNA upon transcription. 

In accordance with the present invention, the term determining the expression 
profile" preferably refers to the determination of the level of expression, namely of 
said group of markers. 

20 As used herein, the term „marker" refers to a DNA, in particular cDNA, or RNA or a 
fragment thereof or a protein or a fragment thereof which are in the case of RNA 
(or cDNA) formed upon transcription of a nucleotide sequence which is capable of 
expression. The nucleic acid molecule fragments refer to fragments preferably of 
at least 8 such as ten, twelfe, fifteen or eighteen nucleotides in length representing 

25 a consecutive stretch of nucleotides of a gene, cDNA or mRNA such as of 20 or 
25 nucleotides that are, for example, further specified in the appended Tables or a 
complementary sequence thereto. In other terms, markers include any fragment 
(or complementary sequence thereto) of the sequences depicted in the appended 
tables as long as these fragments unambiguously identify the marker. Typical 

30 fragment lengths are provided above. The determination of the expression profile 
of markers may be effected at the transcriptional or translational level. In other 
terms, the method of the invention envisages the determination at the level of 
mRNA or at the protein level. Protein fragments such as peptides advantageously 
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comprise at least 6 consecutive amino acids representative of the corresponding 
full length protein. 6 amino acids are generally recognized as the lowest peptidic 
stretch giving rise to a linear epitope recognized by an antibody, fragment or 
derivative thereof. Alternatively, the proteins or fragments thereof may be analysed 
5 using nucleic acid molecules specifically binding to three-dimensional structures 
(aptamers). In principle, the investigator may determine, in accordance with the 
method of the invention, whether a gene is expressed at all in a leukemic or other 
cell. Alternatively, an investigator may determine the difference in the expression 
level, for example, between a leukemic and a non-leukemic cell or between two or 

10 more different types or subtypes of leukemia. If the sample comprises only other, 
i.e. non-leukemia cells, then the patient's suffering from a leukaemia may safely be 
denied. Insofar, the above main embodiment is to be understood that if the 
presence of other cells is determined then this determination includes an 
assessment to the effect that only other cells but no leukemic cells are comprised 

15 in the sample. On the other hand, the determination of leukemic cells may include 
the further characterization of such cells including the differentiation status of the 
cells as well as the distinction from other types of cancer cells or other subtypes of 
leukaemia cells. Particular embodiments in this regard are further outlined herein 
below. 

20 

In accordance with the above, the present invention also contemplates methods 
where simply the assessment of leukaemia cells but not necessarily of other cells 
is effected. This holds true for all embodiments where the determination of other 
cells is mentioned. It is to be understood that with the exception of the possible 

25 determination of other cells, the steps of the various methods of the invention 
remain unchanged. Thus, the invention also relates to a method of determining 
whether a patient sample contains leukemia cells comprising the steps of a) 
determining the the expression profile of a group of markers in a patient sample 
and b) concluding from expression profile whether the patient sample contains 

30 leukemia cells characterized in that the group of markers consists of markers 
selected independently from the markers listed in one or more of the tables 3 to 6, 
tables 15 to 20, tables 29, 30, 41 , or 42 and whereby the number of markers in the 
group is between one and the total number of markers listed in the tables 3 to 6, 
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tables 15 to 20, and tables 29, 30, 41, or 42. Thus, the invention further relates to 
a method of determining whether a patient sample contains leukemia cells and at 
the same time or subsequently determining the type and subtype of leukemia 
cells, if leukemia cells are present, comprising the steps of a) determining the 
5 expression profile of a group of markers in a patient sample and b) concluding 
from the expression profile whether the patient sample contains leukemia cells and 
at the same time or subsequently determining the type and subtype of leukemia 
cells, if leukemia cells are present, characterized in that the group of markers 
consists of markers selected independently from the markers listed in one or more 
10 of the tables 16 to 20 or table 29 or 30 and whereby the number of markers in the 
group is between one and the total number of markers listed in the tables 16 to 20 
or table 29 or 30, to name two important embodiments of the invention. 

Determination of the expression profile/levels may be effected by a variety of 

15 methods, depending on the nature of the marker. Thus, if the marker is mRNA, 
cDNA may be prepared into which a detectable label, such as a fluorescent, 
chemiluminescent, bioluminescent, radioactive (such as 3 H or 32 P) label is 
incorporated. Said detectably labelled cDNA, in single-stranded form, may then be 
hybridised, preferably under stringent or highly stringent conditions to a panel of 

20 single-stranded oligonucleotides representing different genes and affixed to a solid 
support such as a chip. Upon applying appropriate washing steps, those cDNAs 
will be detected or quantitatively detected that have a counterpart in the 
oligonucleotide panel. Various advantageous embodiments of this general method 
are feasible. For example, the mRNA or the cDNA may be amplified wherein it is, 

25 for quantitative assessments, preferable that the number of amplified copies 
corresponds relative to further amplified mRNAs or cDNAs to the number of 
mRNAs originally present in the cell. Also, the cDNAs may be transcribed into 
cRNAs wherein only in the transcription step a label is incorporated into the 
nucleic acid and wherein the cRNA is employed for hybridisation. Alternatively, the 

30 lable may be attached subsequent to the transcription step. Similarly, proteins 
from a cell or tissue under investigation may be contacted with a panel of 
aptamers or of antibodies or fragments or derivatives thereof. The antibodies etc. 
may be affixed to a solid support such as a chip. Binding of proteins indicative of a 
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leukemia or a subtype of leukaemia may be verified by binding to a detectably 
labelled secondary antibody or aptamer. For the labelling of antibodies, it is 
referred to Harlow and Lane, "Antibodies, a laboratory manual", CSH Press, 1988, 
Cold Spring Harbor. As regards further test assays and formats, it is referred to 
5 further embodiments of the invention as specified herein below as well as to the 
appended examples. In addition, a number of applicable assay formats are 
available in the art that can applied to the method of the invention without further 
ado. Specifically, a minimum set of proteins necessary for diagnosis of all 
leukemia types may be selected for creation of a protein array system to make 

10 diagnosis on a protein lysate of a diagnostic bone marrow sample directly. Protein 
Array Systems for the detection of specific protein expression profiles already are 
available (for example: Bio-Plex, BIORAD, Munchen, Germany). For this 
'application "preferably" antibodies "against the proteins have to be produced and 
immobilized on a platform e.g. glasslides or microtiterplates. The immobilized 

15 antibodies can be labeled with a reactant specific for the certain target proteins as 
discussed above. The reactants can include enzyme substrates, DNA, receptors, 
antigens or antibodies to create for example a capture sandwich immunoassay. 

The level of the expression of the „marker" is indicative of a leukemic condition, of 
20 a cell or an organism. The level of expression of a marker or group of markers is 
measured and is compared with the level of expression of the same marker or the 
same group of markers from other cells or samples. The comparison may be 
effected in an actual experiment or in silico. When the expression level also 
referred to as expression pattern or expression signature (expression profile) is 
25 measurably different, there is according to the invention a meaningful difference in 
the level of expression. Preferably the difference at least is 5 %, 10% or 20%, 
more preferred at least 50% or may even be as high as 75% or 100%. More 
preferred the difference in the level of expression is at least 200%, i.e. two fold, at 
least 500%, i.e. five fold, or at least 1000%, i.e. 10 fold. 

30 The present invention allows to diagnose a wide variety and at least 14 different 
clinically relevant leukemia subtypes. Therefore, the invention of a combination of 
marker genes and their specific expression level it is possible to substitute all other 
mandatory diagnostic approaches including the approach of Golub and colleagues 
(cytomorphology or histomorphology, multiparameter-immunophenotyping, 
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cytogenetics, fluorescence in situ hybridization, and molecular genetics) in one 
single step with a specifity and sensitivity that had never been achieved in all other 
techniques used so far. 

In more detail, based on biomathematical analysis of gene expression profiles a 
5 new method could be provided which forms the basis for designing and devejoping 
a novel diagnostic approach preferably based on microarray technology. Further, 
subsets of markers, preferably genes could be introduced which allow the 
determination of leukemia type and subtype. The method according to the 
invention abolishes today's standard procedures in diagnosis of leukemia. These 

10 standard diagnostic procedures require more and more centralized core facilities 
with both personal experts in the fields of cytomorphology, cytogenetics and 
molecular genetics and expensive lab equipment, which causes increasing costs 
for adequate diagnosis. The present invention provides novel cost-effective 
methods and diagnostic tools, which are less time consuming, easy to operate but 

1 5 nevertheless as accurate and safe as all standard methods combined today. The 
genes or sets of genes allows to assign clinical samples either as healthy or 
malignant simply based on their gene expression profiles. The genes, 
representative fragments thereof or transcription or translation products thereof 
form the basis for the methods of the invention or diagnostic tools, corresponding 

20 thereto. Furthermore, these genes etc. allow to predict the diagnoses based on 
the genetic abnormality of the expression pattern and to discriminate between 
different prognostic relevant entities. When comparing two groups of microarray 
experiments, Golub's method (Science 286 (1999), 531-537) sorts the genes with 
respect to the signal-to-noise ratio of gene x: S x = (//i-//2)/(cri+a 2 ), where //k and <Jk 

25 denote the mean expression and standard deviation of gene x in group k. 

According to a specified number of "informative" genes the 20 best discriminating 
genes are selected. For each informative gene a decision limit is calculated as b x = 
(//1+A/2V2. To classify a new sample of an independent test set, the gene 
expression levels of informative genes are taken and for each gene x and sample 
30 y a so-called vote is calculated as V x = S x (g x y - b x ), where g x y denotes expression 
level of gene x in sample y. The votes of all informative genes are summed up 
("weighted voting") and depending upon the sign of this sum the new sample is 
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classified as group 1 or group. 2. The confidence in the prediction is calculated as 
|ZV x /I|Vx||. 

To assess the significance of each gene, a permutation test is performed, which 
determines signal-to-noise ratios when class labels are permuted randomly. 
5 To assess the robustness of the classifier, a leave-one-out crossvalidation is 
performed. Accuracy \s the rate of correctly classified test samples. 

The decision limit proposed by Golub does not provide optimal classification 
accuracy in all situations. When the standard deviation of expression levels within 
the two groups are very different, the decision limit is biased towards the group 
10 with the higher standard deviation. 

A decision limit for a particular gene can be considered optimal, if it achieves 
maximum classification accuracy for a given dataset. By determining 
systematically classification accuracies for a set of possible decision limits, an 
optimal decision limit can be calculated. The underlying statistics as described in 
15 Example 3 select an optimal decision limit from the following set of decision limits 

U = { (g x y + g x y - 1 )/2 1 1 < y <= n } 

where g x y denotes expression level of gene x in sample y, n denotes the total 
number of samples in the training set. 

20 Golubs method selects an arbitrary number of "informative" genes to discriminate 
between two classes of samples according to their signal-to-noise ratio, typically in 
the range of 10 to 50 genes. 

Choosing too many genes like in Golub's method carries the risk of overfitting, 
which causes poor generalization features of the model. 

25 Therefore the present invention applies an heuristic approach to select a minimal 
set of discriminative genes, which provides maximum classification accuracy in 
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leave-one-out-crossvalidation. I.e. for a given set of genes weighted voting as 
described by Golub is applied and the classification accuracy is calculated by 
crossvalidation used in accordance with the present invention and representing a 
further embodiment in accordance with this invention. 

i 

5 The method for achieving this used in accordance with the present invention and 
representing a further embodiment in accordance with this invention consists of 
the following steps: 

(a) calculating of the top 20 discriminating genes according to the signal-to- 
noise ratio (top 20 SNR's); 

10 - (b) calculating .classification accuracy_ and confidence based on optimal 
decision limits for each of the top 20 genes; 

(c) selecting the gene which provides best classification accuracy and 
confidence out of step 2; and 

(d) testing for each of the remaining 19 genes, whether adding this gene to the 
15 model improves accuracy and confidence. 

If the gene improves accuracy and confidence, it is added to the weighted voting 
model, otherwise it is discarded. 

Preferably, the decision limit is set according to the formula recited above. 

In a pilot study consisting of 103 Affymetrix Genechip microarrays with 12625 
20 genes each as shown in the appended examples we compared the results 
achieved with Golub's method and with our extended method. 

Table A presents an analysis of 1 8 samples class A versus 85 samples class non- 
A. Based on 20 informative genes Golub's method results in a crossvalidation 
accuracy of 0,87 (confidence 0,77); achieves with three genes out of the top 20 set 
25 a crossvalidation accuracy of 0,96 (confidence 0,88). 
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The same analysis was performed for one versus all (OVA) and all pairs (AP) 
comparisons in this dataset consisting of 5 different classes. Figure 13 b presents 
accuracy and confidence obtained by both methods: the method of the invention 
outperforms Golub's method clearly both in terms of accuracy and confidence of 
5 classifications. ■ 

The development of a leukemia diagnostic tool, preferably microarray based, 
allows for all patients which are preferably humans and specimens a reproducible, 
highly specific and rapid method to obtain important information for treatment 
strategies in leukemia. This technique can be established in every laboratory using 

10 basic methods of molecular biology, and preferably makes use of hybridization 
and amplification such as PCR or LCR based techniques and does not require 
hematologists or cytogeneticists with several years of experience in leukemia 
diagnostics. Material for the analysis can be sent over large distances as it is not 
necessary that cells arrive viable in the laboratory. Therefore, a centralization of 

15 leukemia diagnostics with very high quality is possible. 

1 i 

Moreover, the accumulation of an immense knowledge about gene expression 
profiles in leukemia types and subtypes, which are not characterized by specific 
genetic abnormalities, leads to a more precise classification compared to all other 
methods used so far. In addition, the data compiled in accordance with the 
20 invention are helpful for the understanding of the pathogenesis of leukemia and 
will allow to identify genes which are specifically dysregulated. They may be 
considered as potential targets for therapeutic interventions specifically designed 
for the different leukemia subtypes. 

Preferably the method according to the invention is characterized in that the group 
25 of markers consists of between two, such as three, four, five, six, seven, eight, 
nine or ten and the total number of markers listed in one or more of the tables 3 to 
6, tables 15 to 20, and tables 29, 30, 41, or 42. Most preferred, the group consists 
of all markers listed in one or more tables, whereby the tables are selected from 
the the tables 3 to 6, tables 15 to 20, and tables 29, 30, 41, or 42. The invention 
30 also contemplates that all markers in all tables are analysed. This holds true for 
the presently discussed as well as for embodiments discussed further below. 
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Another embodiment of the invention relates to a method of determining whether 
a patient sample contains leukemia cells or other cells and at the same time or 
subsequently determining the type and subtype of leukemia cells, if leukemia cells 
are present, comprising the steps of determining the expression profile, preferably 
5 the level of expression of a group of markers in a patient sample and concluding 
from the (altered) expression profile i.e. the difference in the level of expression, 
whether the patient sample contains leukemia cells or other cells and at the same 
time determining the type and subtype of leukemia cells, if leukemia cells are 
present, characterized in that the group of markers consists of markers selected 

10 independently from the markers listed in one or more of the tables 16 to 20 or 
table 29 or 30 and whereby the number of markers in the group is between one, 
preferably two such as three, four, five, six, seven, eight, nine or ten and the total 
numBer ofliiarkersnisfed irTone drltiore of the tables 16 to 20 or table 29 or 30. It 
is preferred that the group of markers consists of all markers listed in one or more 

1 5 tables, whereby the tables are selected from the tables 1 6 to 20 or table 29 or 30. 
In a preferred embodiment it is differentiated between four' types of leukemia cells 
and the other cells in the patient sample. The other cells are preferably normal 
cells. 

20 The "other cells" may be, for example, cells affected by a disease which is not a 
leukaemia. It is preferred, in accordance with the present invention that said other 
cells are normal cells, i.e. cells not affected by any disease. 

This embodiment of the present invention allows for the differentiation between 
four different types of leukemias, i.e. AML, CLL, CML and ALL. As has been 

25 surprisingly demonstrated in accordance with the present invention, the qualitative 
and/or quantitative determination of an expression profile of a number of genes „ 
allows the unambiguous classing with any of the above and currently established 
types of leukemias. In principle and more preferred, the relation of the gene 
expression profile to the leukaemia type may take place at the same time at which 

30 the determination of the leukaemia cells in the sample takes place. Alternatively, 
the classification may be effected at a later time point. It was surprising that the 
distinction between the large number of leukemia types and subtypes, including 
the cytogenetically and immunophenotypically defined, as well as types 
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characterized by complex chromosomal aberations, could be accomplished 
preferably by the use of a microarray for the detection of the expression level of a 
marker or a group of markers with such ease and accuracy. In particular, certain 
preferred subsets of genes are provided which can either be used to determine the 
5 leukemia t^pe and subtype, or only determine the subtypes of a certain leukemia 
type or differentiates certain types or subtypes, respectively, from one another. 

In another embodiment a method is disclosed which allows differentiating between 
two types of leukemia cells or one type of leukemia cells and normal cells or non- 
leukemia cells in a patient sample comprising the steps of determining the 

10 expression profile preferably the level of expression, of a group of markers in the 
patient sample and concluding from the (altered) expression profile, i.e. the 
difference in the level of expression, which type of leukemia cells the patient 
sample contains or whether it contains (only) normal cells or non-leukemia cells 
characterized in that the group of markers consists of markers selected 

1 5 independently from the markers listed in one or more of the tables 3 to 6 or tables 
7 to 12 and whereby the number of markers in the group is between one, 
preferably two such as three, four, five, six, seven, eight, nine or ten and the total 
number of markers listed in one or more of the tables 3 to 6 or tables 7 to 12. In a 
preferred embodiment the group of markers consists of all markers listed in one or 

20 more of the tables 3 to 6 or tables 7 to 12. 

In another embodiment of the invention a method is disclosed allowing the 
differentiation between the subtypes of AML cells or between the subtypes of AML 
cells and normal cells in a patient sample comprising the steps of determining the 

25 expression profile, preferably the level of expression of a group of markers in the 
patient sample and concluding from the the (altered) expression profile, i.e. the 
difference in the level of expression, which subtypes of AML cells the patient 
sample contains or whether it contains normal cells characterized In that the group 
of markers consists of markers selected independently from the markers listed in 

30 one or more of the tables 1,2, 13, 14, 17, 25, 27, 35 and 36 and whereby the 
number of markers in the group is between one, preferably two such as three, 
four, five, six, seven, eight, nine or ten and the total number of markers listed in 
one or more of the tables 1, 2, 13, 14, 17, 25, 27, 35 and 36. In a preferred 
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embodiment the group of markers consists of all markers listed in one or more of 
the tables 1 , 2, 13, 14, 17, 25, 27, 35 and 36. It is preferred that three, four or more 
subtypes of AML cells are determined. 

5 In another embodiment- of the invention a method is disclosed allowing the 
differentiation between and thus the determination of the subtypes of ALL cells in a 
patient sample comprising the steps of (a) determining the level of expression of a 
group of markers in the patient sample and (b) concluding from the differences in 
the level of expression which subtypes of ALL cells the patient sample contains 

10 whereby the group of markers consists of markers selected independently from 
the markers listed in one or more of the tables 18, 32 or 33 and whereby the 
number of markers in the group is between one, preferably two such as three, 
four, five, six, seven, eight, nine or ten and the total number of markers listed in 
one or more of the tables 18, 32 or 33. It is preferred that the group of markers 

15 consists of all markers listed in one or more of the tables 18, 32 or 33. 

In another embodiment of the invention a method is disclosed allowing the 
differentiation between and thus the determination of the subtypes of CLL cells in 
a patient sample comprising the steps of determining the level of expression of a 

20 group of markers in the patient sample and concluding from the differences in the 
level of expression which subtypes ,of CLL cells the patient sample contains 
whereby the group of markers consists of markers selected independently from 
the markers listed in one or more of the tables 38 or 39 and whereby the number 
of markers in the group is between one, preferably two such as three, four, five, 

25 six, seven, eight, nine or ten and the total number of markers listed in one or more 
of the tables 38 or 39. It is preferred that the group of markers consists of all 
markers listed in one or more of the tables 38 or 39. 

In another embodiment of the invention, a method is disclosed of assessing the 
30 efficacy of a test compound for inhibiting leukemia, the method comprising 
comparing the expression profile of a group of markers in a first sample obtained 
from the patient and maintained in the presence of the test compound and the 
expression profile of a group of markers in a second sample obtained from the 
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patient and maintained in the absence of the test compound, wherein a 
significantly altered expression profile of the group of markers in the first sample, 
relative to the second sample, is an indication that the test compound is 
efficacious for inhibiting leukemia in the patient characterized in that the group of 

5 markers consists of markers selected independently from the markers listed in one 
or more of the tables 1 to 20, tables 25 or 27 or tables 29, 30, 32, 33, 35, 36, 38, 
39, 41, 42 and whereby the number of markers in the group is between one, 
preferably two such as 3, 4, 5, 6, 7, 8, 9 or 10 and the total number of markers 
listed in the tables 1 to 20, tables 25 or 27 or tables 29, 30, 32, 33, 35, 36, 38, 39, 

10 41,42. 

In accordance with this embodiment of the present invention, it is again preferred 
That in" the comparison of expression profiles expression levels and differences in 
expression levels are determined and compared. It is further preferred that the 

15 alteration determined in accordance with the method of the invention in the 
expression profile or expression level must be in the direction of the expression 
profile of normal cells or at least diseased but non-leukemic cells. More preferably 
the alteration should be in the direction of normal blood cells, more preferably cells 
of the certain type. Accordingly, it is also preferred that the comparison includes an 

20 internal standard of expression levels of analysed markers wherein the internal 
standard represents the expression profile of non-leukemic and preferably normal 
cells. The comparison may be effected by relying on actual experimental data or 
on in silico obtained reference data. 

25 In another embodiment of the invention a method is disclosed of assessing the 
efficacy of a therapy for inhibiting leukemia in a patient, the method comprising 
comparing the expression profile, preferably the level of expression of a group of 
markers in the first sample obtained from the patient prior to providing at least a 
portion of the therapy to the patient and the expression profile, preferably the level 

30 of expression of a group of markers in a second sample obtained from the patient 
following provision of the portion of the therapy, wherein a significantly (altered) 
expression profile, i.e. a significantly (altered) difference in the level of expression 
of the group of markers in the second sample, relative to the first sample, is an 
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indication that the therapy is efficacious for inhibiting leukemia in the patient 
characterized in that the group of markers consists of markers selected 
independently from the markers listed in one or more of the tables 1 to 20, tables 
25 or 27 or tables 29, 30, 32, 33, 35, 36, 38, 39, 41 , 42 and whereby the number 
5 of markers in the group is between one, preferably two such as 3, 4, 5, 6, 7, 8, 9 or 
10 and the total number of markers listed in the tables 1 to 20, tables 25 oc 27 or 
tables 29, 30, 32, 33, 35, 36, 38, 39, 41, or 42. 

As with the previous embodiment, the alteration determined in accordance with the 
10 method of the invention in the expression profile or expression level must be in the 
direction of the expression profile or normal cells or at least diseased but non- 
leukemic cells. Accordingly, it is also preferred in accordance with this 
embodiment that the" comparison includes _ an internal -standarad -of expression 
levels of analysed markers wherein the internal standarad represents the 
15 expression profile of non-leukemic and preferably normal cells. The comparison 
may _ again - be effected by relying on actual experimental data or on in silico 
obtained reference data. 

Within the therapy of the patient, compounds may be administered that have at 
20 least passed phase II and preferably are whithin phase III of clinical trials. 
Advantageously, in one embodiment, a therapeutical composition or medicinal 
product is administered that comprises one pharmaceutical^ active compound. In 
alternative embodiments, pharmaceutical compositions or medicinal products are 
administered that comprise more than one pharmaceutical^ active compound. If 
25 the composition or product comprises more than at least one pharmaceutically 
active compound then one of the compounds may aim at the direct reduction of 
tumor load wherein at least one further compound may fulfil an accessory function 
such as the general stimulation of the immune system. Compounds of the latter 
class are also well known in the art and comprise plant derived products as well as 
30 immunostimulatory molecules selected from the group of interleukins, interferons 
and others. 
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Additionally, the invention contemplates a method of refining a compound 
identified by the method as described herein above, said method comprising 
optionally the steps of said methods and: 

(1) identification of the binding sites of the compound and the target molecule 
5 by site-directed mutagenesis or chimeric protein studies; 

(2) molecular modeling of both the binding site of the compound and the 
binding site of the target molecule; and 

(3) modification of the compound to improve its binding specificity for the target. 

10 The target may in accordance with-the above be DNA, mRNA.or .protein. All 
techniques employed in the various steps of the method of the invention are 
conventional or can be derived by the person skilled in the art from conventional 
techniques without further ado. Thus, biological assays based on the herein 
identified nature of the proteins/(poly)peptides may be employed to assess the 

15 specificity or potency of the drugs wherein the increase of one or more activities of 
the proteins/(poly)peptides may be used to monitor said specificity or potency. 
Steps (1) and (2) can be carried out according to conventional protocols. A 
protocol for site directed mutagenesis is described in Ling MM, Robinson BH. 
(1997) Anal. Biochem. 254: 157-178. The use of homology modeling in 

20 conjunction with site-directed mutagenesis for analysis of structure-function 
relationships is reviewed in Szklarz and Halpert (1997) Life Sci. 61:2507-2520. 
Chimeric proteins are generated by ligation of the corresponding DNA fragments 
via a unique restriction site using the conventional cloning techniques described in 
Sambrook (1989), loc. cit.. A fusion of two DNA fragments that results in a 

25 chimeric DNA fragment encoding a chimeric protein can also be generated using 
the gateway-system (Life technologies), a system that is based on DNA fusion by 
recombination. A prominent example of molecular modeling is the structure-based 
design of compounds binding to HIV reverse transcriptase that is reviewed in Mao, 
Sudbeck, Venkatachalam and Uckun (2000). Biochem. Pharmacol. 60: 1251- 

30 1265. 



03039443A2_L> 



WO 03/039443 



PCT/EP02/12303 



19 

For example, identification of the binding site of said drug by site-directed 
mutagenesis and chimerical protein studies can be achieved by modifications in 
the (poly)peptide primary sequence that affect the drug affinity; this usually allows 
to precisely map the binding pocket for the drug. 
5 As regards step (2), the following protocols may be envisaged: Once the effector 
site for drugs has been mapped, the precise residues interacting with different 
parts of the drug can be identified by combination of the information obtained from 
mutagenesis studies (step (1)) and computer simulations of the structure of the 
binding site provided that the precise three-dimensional structure of the drug is 
10 known (if not, it can be predicted by computational simulation). If said drug is itself 
a peptide, it can be also mutated to determine which residues interact with other 
residues in the (poly)peptide of interest. 

Finally, in step' (3) the~drug "can Tbfe modified to~improve its binding affinity or ist 
potency and specificity. If, for instance, there are electrostatic interactions between 
15 a particular residue of the (poly)peptide of interest and some region of the drug 
molecule, the overall charge in that region can be modified to increase that 
particular interaction. 

Identification of binding sites may be assisted by computer programs. Thus, 
appropriate computer programs can be used for the identification of interactive 

20 sites of a putative inhibitor and the (poly)peptide by computer assisted searches 
for complementary structural motifs (Fassina, Immunomethods 5 (1994), 1 14-120). 
Further appropriate computer systems for the computer aided design of protein 
and peptides are described in the prior art, for example, in Berry, Biochem. Soc. 
Trans. 22 (1994), 1033-1036; Wodak, Ann. N. Y. Acad. Sci. 501 (1987), 1-13; 

25 Pabo, Biochemistry 25 (1986), 5987-5991. Modifications of the drug can be 
produced, for example, by peptidomimetics and other inhibitors can also be 
identified by the synthesis of peptidomimetic combinatorial libraries through 
successive chemical modification and testing the resulting compounds. Methods 
for the generation and use of peptidomimetic combinatorial libraries are described 

30 in the prior art, for example in Ostresh, Methods in Enzymology 267 (1996), 220- 
234 and Dorner, Bioorg. Med. Chem. 4 (1996), 709-715. Furthermore, the three- 
dimensional and/or crystal lographic structure of activators of the expression of the 
(poly)peptide of the invention can be used for the design of peptidomimetic 
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activators, e.g., in combination with the (poiy)peptide of the invention (Rose, 
Biochemistry 35 (1996), 12933-12944; Rutenber, Bioorg. Med. Chem. 4 (1996), 
1545-1558). 

In accordance with the above, in a preferred embodiment of the method of the 
5 invention said at least one compound is further refined by peptidomimetics. 

The invention furthermore relates to a method of modifying a compound identified 
or refined by the method as described herein above as a lead compound to 
achieve (i) modified site of action, spectrum of activity, organ specificity, and/or (ii) 
improved potency, and/or (iii) decreased toxicity (improved therapeutic index), 
and/or (iv) decreased side effects, and/or (v) modified onset of therapeutic action, 
duration of effect, and/or (vi) modified pharmakinetic parameters (resorption, 
distribution, metabolism" and excretion), and/or (vii) modified physico-chemical 
parameters (solubility, hygroscopicity, color, taste, odor, stability, state), and/or 
(viii) improved general specificity, organ/tissue specificity, and/or (ix) optimized 
application form and route by (i) esterification of carboxyl groups, or (ii) 
esterification of hydroxyl groups with carbon acids, or (iii) esterification of hydroxyl 
groups to, e.g. phosphates, pyrophosphates or sulfates or hemi succinates, or (iv) 
formation of pharmaceutical^ acceptable salts, or (v) formation of 
pharmaceutical^ acceptable complexes, or (vi) synthesis of pharmacologically 
active polymers, or (vii) introduction of hydrophylic moieties, or (viii) 
introduction/exchange of substituents on aromates or side chains, change of 
substituent pattern, or (ix) modification by introduction of isosteric or bioisosteric 
moieties, or 

(x) synthesis of homologous compounds, or (xi) introduction of branched side 
25 chains, or (xii) conversion of alkyl substituents to cyclic analogues, or (xiii) 
derivatisation of hydroxyl group to ketales, acetales, or (xiv) N-acetylation to 
amides, phenylcarbamates, or (xv) synthesis of Mannich bases, imines, or (xvi) 
transformation of ketones or aldehydes to Schiff's bases, oximes, acetales, 
ketales, enolesters, oxazolidines, thiozolidinesor combinations thereof; said 
30 method optionally further comprising the steps of the above described methods. 
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The various steps recited above are generally known in the art. They include or 
rely on quantitative structure-action relationship (QSAR) analyses (Kubinyi, 
"Hausch-Analysis and Related Approaches", VCH Verlag, Weinheim, 1992), 
combinatorial biochemistry, classical chemistry and others (see, for example, 
5 Holzgrabe and Bechtold, Deutsche Apotheker Zeitung 140(8), 813-823, 2000). 



The invention moreover relates to a method of producing a pharmaceutical 
composition comprising optionally the steps of the aforementioned methods and 
further the step of formulating the at least one compound identified, refined or 

10 modified by the method of any of the preceding embodiments with a 
pharmaceutical ly active carrier or diluent. 
-The pharmaceutical composition- produced in accordance with the- present 
invention may further comprise a pharmaceutical^ acceptable carrier and/or 
diluent and/or excipient. Examples of suitable pharmaceutical carriers are well 

15 known in the art and include phosphate buffered saline solutions, water, 
emulsions, such as oil/water emulsions, various types of wetting agents, sterile 
solutions etc. Compositions comprising such carriers can be formulated by well 
known conventional methods. These pharmaceutical compositions can be 
administered to the subject at a suitable dose. Administration of the suitable 

20 compositions may be effected by different ways, e.g., by intravenous, 
intraperitoneal, subcutaneous, intramuscular, topical, intradermal, intranasal or 
intrabronchial administration. The dosage regimen will be determined by the 
attending physician and clinical factors. As is well known in the medical arts, 
dosages for any one patient depends upon many factors, including the patient's 

25 size, body surface area, age, the particular compound to be administered, sex, 
time and route of administration, general health, and other drugs being 
administered concurrently. A typical dose can be, for example, in the range of 
0.001 to 1000 jLxg (or of nucleic acid for expression or for inhibition of expression in 
this range); however, doses below or above this exemplary range are envisioned, 

30 especially considering the aforementioned factors. Generally, the regimen as a 
regular administration of the pharmaceutical composition should be in the range of 
1 M.g to 10 mg units per day. If the regimen is a continuous infusion, it should also 
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be in the range of 1 jug to 10 mg units per kilogram of body weight per minute, 
respectively. Progress can be monitored by periodic assessment. Dosages will 
vary but a preferred dosage for intravenous administration of DNA is from 
approximately 10 6 to 10 12 copies of the DNA molecule. The compositions of the 
5 invention may be administered locally or systemically. Administration will generally 
be parenterally, e.g., intravenously; DNA may also be administered directly 4o the 
target site, e.g., by biolistic delivery to an internal or external target site or by 
catheter to a site in an artery. Preparations for parenteral administration include 
sterile aqueous or non-aqueous solutions, suspensions, and emulsions. Examples 

10 of non-aqueous solvents are propylene glycol, polyethylene glycol, vegetable oils 
such as olive oil, and injectable organic esters such as ethyl oleate. Aqueous 
carriers include water, alcoholic/aqueous solutions, emulsions or suspensions, 
including saline and buffered media. Parenteral vehicles include sodium chloride 
solution, Ringer's dextrose, dextrose and sodium chloride, lactated Ringer's, or 

15 fixed oils. Intravenous vehicles include fluid and nutrient replenishers, electrolyte 
replenishers (such as those based on Ringer's dextrose), and the like. 
Preservatives and other additives may also be present such as, for example, 
antimicrobials, anti-oxidants, chelating agents, and inert gases and the like. 
Furthermore, the pharmaceutical composition of the invention may comprise 

20 further agents such as interleukins or interferons depending on the exact intended 
use of the pharmaceutical composition. 

The above methods referring to downstream developments also apply to 
therapeutically effective compounds referred to in additional embodiments herein 
below. 

25 In another embodiment of the invention a method is disclosed of selecting a 
composition for inhibiting leukemia in a patient, the method comprising separately 
maintaining aliquots of cells of a patient sample in the presence of a plurality of 
test compositions, comparing the expression profile, preferably the level of 
expression of a group of markers in each of the aliquots, and selecting one of the 

30 test compositions which induces an altered expression profile of the group of 
markers in the aliquot containing that test composition, relative to other test 
compositions characterized In that the group of markers consists of markers 
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selected independently from the markers listed in one or more of the tables 1 to 
20, tables 25 or 27 or tables 29, 30, 32, 33, 35, 36, 38, 39, 41 ,' 42 and whereby the 
number of markers in the group is between one, preferably two such as 3, 4, 5, 6, 
7, 8, 9 or 10 and the total number of markers listed in the tables 1 to 20, tables 25 
5 or 27 or tables 29, 30, 32, 33, 35, 36, 38, 39, 41 , 42. 

Again, as with the previously recited embodiments, the alteration determined in 
accordance with the method of the invention in the expression profile or 
expression level must be in the direction of the expression profile of normal cells or 

10 at least diseased but non-leukemic cells. Accordingly, it is also preferred in 
accordance with this embodiment that the comparison includes an internal 
standarad of expression levels of analysed markers wherein the internal standarad 
represents the expression prof We of non-leukemic and preferably normal cells. The 
comparison may - again - be effected by relying on actual experimental data or 

15 on in silico obtained reference data. 

The expression "in the direction of the expression profile of normal cells" as used 
herein preferably relates to cells that comprise blood cells, more preferably a 
single type of blood cells. Most preferably, the single type of cells corresponds to 

20 the type of the leukemic cell. For example, an AML type of leukemic cell would 
preferably be compared to a healthy myeloic blast cell whereas a ALL type of 
leukemic cell would preferably be compared to a healthy lymphatic blast cell. 
Myeloic blast cells and lymphatic blast cells may be isolated from healthy bone 
marrow using well known methods, such as cell sorting based on flow cytometry 

25 using established cell surface markers. 

In this method of the invention, it is preferred that the test composition comprises 
only one putatively active test compound. Insofar, the correlation with the activity 
of the test compound and the readout is particularly convenient. If the test 
30 composition comprises more than one putatively pharmaceutical^ active 
compounds, it may be considered to separately test each compound in a 
composition that has tested positive in a first round of the assay. Consequently, in 
a second round, i.e. in a repetition of steps (a) and (b), the various compositions 
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tested positive, if any, in the first round, may be subdivided into single compounds 
and these single compounds tested again for their efficacy. The goal of such an 
approach, of course, is to obtain a composition comprising a single active 
compound only. 

5 

In another embodiment a method of determining new subtypes of leukemia cells is 
diclosed, the method comprising determining, the expression profile, preferably the 
level of expression of a group of markers of leukemia cells of unknown subtype, 
comparing the expression profile to the level of expression, ie. the expression 

10 profile, of a group of markers of leukemia cells of known subtype, thereby 
concluding that a new subtype is determined when the expression profile, 
preferably the level of expression is different to all known subtypes characterized 
in that the group of markers consists of markers selected independently from the 
markers listed in one or more of the tables 1 to 20, tables 25 or 27 or tables 29, 

15 30, 32, 33, 35, 36, 38, 39, 41, 42 and whereby the number of markers in the 
group is between one, preferably two and the total number of markers listed in the 
tables 1 to 20, tables 25 or 27 or tables 29, 30, 32, 33, 35, 36, 38, 39, 41 , 42. 

The term "subtype of leukemia cells" in accordance with the present invention may 
20 be better understood in accordance with the following Leukemias are subdivided 
according to their natural clinical course into acute and chronic leukemias. Based 
on the cell line they are derived from they are further subdivided into myeloid and 
lymphatc leukemias. This results in four leukemia types, i.e. acute myeloid 
leukemia (AML), acute lymphoblastic leukemia (ALL), chronic myeloid leukemia 
25 (CML), and chronic lymphatic leukemia (CLL). Based on genetic, phenotypic, and 
biological characteristic, which are assessed by cytomorphology, cytochemistry, 
cytogenetics, immunophenotyping, and molecular genetics, AML, ALL, and CLL 
are further subdivided into subtypes. These subtypes are associated with highly 
differing prognoses. Treatment approaches specific for these subtypes are applied 
30 and are being further optimized. Thus, an exact diagnosis based on a reliable and 
reproducible method is essential for the selection of the appropriate subtype- 
specific treatment. 



BNSDOCID: <WO 03039443A2_I_> 



WO 03/039443 



PCT/EP02/12303 



25 

The new subtypes identified in accordance with the invention may then be 
subjected in the same or in further patients to the other methods/embodiments of 
the invention. 

5 In another embodiment a method is disclosed for guiding the therapy of leukemia 
in a patient depending on the leukemia subtype and/or the risk of relapse of 
disease, the method comprising determining the expression profile, preferably the 
level of expression of a group of markers in the patient sample, and deciding about 
the therapy strategy depending on the leukemia subtype or the risk of relapse of 

10 disease characterized in that the group of markers consists of markers selected 
independently from the markers listed in one or more of the 1 to 20, tables 25 or 
27 or tables 29, 30, 32, 33, 35, 36, 38, 39, 41, 42 and whereby the number of 
markers in the group is between one, preferably two such as 3, 4, 5; 6, 7, 8,.9 or 
10 and the total number of markers listed in the tables 1 to 20, tables 25 or 27 or 

1 5 tables 29, 30, 32, 33, 35, 36, 38, 39, 41 ,42. 

i 

This embodiment is particularly important for the quick and reliable recovery of the 
patient from the leukemia that effects him or her. As has been stated above, the 
early and reliable diagnosis of the leukaemia type or subtype is particularly 

20 important for the instigation of a useful and straightforward treatment regimen. An 
incorrect diagnosis may result in the application of a wrong treatment regimen 
which, in turn, may lead to significant health risks including premature death of the 
patient. In accordance with the present invention, a reliable means has been 
provided that, based on the inventive selection of markers provided, will overcome 

25 the prior art problems of an insecure or an inappropriate time frame demanding 
diagnosis. In particular, the present method of the invention provides in step (a) an 
unambiguous and safe basis for the decision step (b). Again, the patient may 
safely rely on the conclusion drawn in step (b) due to the strong inherent 
correlation that has been achieved between the selection of markers and the 

30 leukemia subtype. The relation of tables to leukemia subtypes has also been 
demonstrated elsewhere in this specification. 
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In another embodiment of the invention, a method for monitoring the progression 
of leukemia in a patient is disclosed, the method comprising determining the the 
expression profile, preferably the level of expression of a group of markers in a 
patient sample at a first point in time, and repeating this step at a subsequent point 

5 in time; and comparing the expression profile, preferably the level of expression 
detected in the previous steps and therefrom monitoring the progression of 
leukemia in the patient, characterized In that the group of markers consists of 
markers selected independently from the markers listed in one or more of the 
tables 1 to 20, tables 25 or 27 or tables 29, 30, 32, 33, 35, 36, 38, 39, 41, 42 and 

10 whereby the number of markers in the group is between one, preferably two such 
as 3, 4, 5, 6, 7, 8,.9 or 10 and the total number of markers listed in the tables 1 to 
20, tables 25 or 27 or tables 29, 30, 32, 33, 35, 36, 38, 39, 41, 42. In a preferred 
embodiment, the patient hasundergone- chemotherapy between the first point in 
time and the subsequent point in time (including repetitions of step (b). 

15 

In this embodiment of the present invention, the skilled artisan may repeat step (b) 
one or more times in order to collect additional data from different (more) time 
points. The additional data obtained by such further measurements may provide 
an overall better overview on the progress of the disease. 
20 In accordance with this embodiment of the disease, the term "progression of 
leukemia" includes the interpretation of "regression of leukemia", i.e. includes the 
interpretation of a negative progression. This is of course in line with the aim of the 
therapy and the desire of the patient. 

25 In the methods according to the invention it is preferred that the group of markers 
consists of markers selected independently from the markers listed in one or more 
of the tables 1 to 20, tables 25 or 27 or tables 29, 30, 32, 33, 35, 36, 38, 39, 41 , 42 
and whereby the number of markers in the group is between one, preferably two 
and the total number of markers listed in the at least one of tables 1 to 20, tables 

30 25 or 27 or tables 29, 30, 32, 33, 35, 36, 38, 39, 41, 42. In a preferred 
embodiment, the number of markers in the group is between five, more preferably 
between 7, 10 or 15 and the total number of markers listed in the tables 1 to 20, 
tables 25 or 27 or tables 29, 30, 32, 33, 35, 36, 38, 39, 41 , 42. It is feasible that the 
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group of markers not only consists of those markers but also comprises them as 
the data will then be still statistically significant, i.e. the preferred groups may 
additionally contain 10, 50 or 100 other markers and comprise the other markers 
according to the invention and mentioned above. It is, however, also feasible for 
5 the expert skilled in the art that only a single suitable marker is determined with the 
methods according to the invention. 

Particulary preferred markers used in a method where only one or a few as e.g. 
one, preferably two markers are used are described in Table 22 and Example 3, 
Fig. 12 or the markers marked with an asterisk in table 20 and shown in tables 16 
10 to 19 as the preferred set of markers. In detail, example 3 mentions (see example 
3 for more details) the following markers including their expression level: 

ADCY3 

adenosine deaminase (ADA) 
ARGHGAP4 

B-cell specific coactivator of octamer binding transcription factors 
CAPN3 is a member of the papain superfamily and was higher expressed in 
CML 

CBF3-MYH11 
CD24 

CD27, was identified to assign samples either ALL or CLL 
CD74 plays a critical role in MHC class II antigen processing 
connective tissue growth factor (CTGF) 
CTGF 
CTSW 
MYH11 

glucocorticoid receptor beta 
higher expression of CBFA2T1 (formerly ETO) 
HLA-DMB 
HOXA9 
HOXB5 

IRF4, an immune system-restricted interferon regulatory factor 
KIAA1013 
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• LCN2 that shown to be a modulator of inflammation 

• LEF-1 was absent in myeloid leukemias but highly expressed in lymphoid 
leukemias 

• MBNL 

5 • MSF translocation partner of the mixed-lineage leukemia gene (MLL) in 
AML 

• NCOA1 expressed higher in CLL as compared to ALL 

• OS-9differentially expressed between AML and ALL (14) 

• Phospholipidscramblase 1 (PLSCR1) to be lower expressed in AML and 
1 0 ALL as compared to normal BM 

. POU2AF1 

• POU2F2 

• POU4F1 

• SCYA3 
15 • SGP28 

• SOCS-2 

• TRB and CD3D 



Particulary preferred markers used in a method where only one or a few as e.g. 

20 one, preferably two markers are used are described in tables 30, 33, 36 and 42 
and Example 7, Figures 189 to 234, 254 to 272, 338 to 371, 433 to 465, 
respectively, or the markers marked with an asterisk in tables 29, 32, 35, 38, and 
41 and Figures 24 to 188, 235 to 253, 273 to 337, 372 to 405, 406 to 432, 
respectively as the preferred set of markers. In detail, example 7 mentions (see 

25 example 7 for more details) the following markers including their expression level: 



genelD 


gene symbol 


feature 


201162_at 


IGFBP7 


CLL low 


201163_s_at 


IGFBP7 


CLL low 


201362_at 


NS1-BP 


CML high 
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201496__x_at 


MYH11 


AML inv(16) high 


201497_x_at 


MYH11 


AML inv(16) high 


201998_at 


SIAT1 


CLL high 


202095_s_at 


BIRC5 


CLL low 


203074_at 


ANXA8 


AMLt(15;17) high 


204150_at 


STAB1 


AMLt(15;17) high 


20451 1_at 


KIAA0793 


CLL high 


205528_s_at 


CBFA2T1 


AMLt(8;21) high 


205529_s_at 


CBFA2T1 


AMLt(8;21) high 


205805_s_at 


ROR1 


CLL high 


206940_s_at 


POU4F1 


AMLt(8;21) high 


20781 9_s_at 


ABCB4 


CLL high 


208091_s_at 


DKF2P564K0822 

i 


CLL high 


208456_s_at 


RRAS2 


CLL high 


209061_at 


NCOA3 


CLL high 


209101_at 


CTGF 


ALL t(4;11) high, ALL Ph 
high, T-ALL high 


209374_s_at 


IGHM 


CLL high 


20961 6_s_at 


CES1 


AML MLL high 
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210997_at 


HGF 


AML t(15;17) high 


212285_s_at 


AGRN 


AMLt(15;17) high 


213539_at 


CD3D 


T-ALL high 


214450_at 


CTSW 


• 

AMLt(15;17) high 


215925_s_at 




ALL t(4;11) high 


218223_s_at 


LOC51177 


CML low 


222166_at 




AML +8 high 


224520_s_at 


MGC13168 


ALL t(8;14) high 


224794_s_at 


LOC51148 


AMLt(15;17) high 


225660__at 


SEMA6A 


ALL B not Ph high, ALL 
Ph high 


226496_at 


Homo sapiens, Similar to hypothetical protein FLJ22611, 
clone MGC:24716 IMAGE:4277726, mRNA, complete cds 


ALL high, CLL high 


228827_at 


Homo sapiens clone 25023 mRNA sequence 


AML t(8;21) high 
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228904_at £ 


ESTs ' 

I 


\ML normal high, AML +8 

i 

nigh, AML complex high 


236301 _at f 


Homo sapiens, clone IMAGE:3866403, mRNA 


DLL high 


236892_s_at 


HOXB6 


AML normal high, AML +8 
high, AML complex high 


23921 4_at 


ESTs 


ALL t(4;11) high 


239393_at 


ESTs 

\ x 


MLL l^**, I I } illy II . 


239791_at 


HOXB6 


AMI normal hinh AMI 4-R 

r\IVlL_ llDlllldl 1 llyll, AIVIU tu 

high 


240581_at 


ESTs 


ALL t(4;11)high 


241464_s_at 


ESTs 

i 


AML MLL high, AML 
normal high, AML +8 high, 
AML complex high 


241525_at 


ESTs 


AML inv(16) high 


243362_s_at 


LEF1 


ALL high, CLL high 


36566_at 


CTNS 


T-ALL low 


38487_at 


FLJ12442 


AMLt(15;17) high 


201105_at 


LGALS1 


ALL t(4;11) high 
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204044_at 


QPRT 


ALL t(4;11) high 


205899_at 


CCNA1 


ALL t(4;11) high 


209168_at 


GPM6B 


ALL t(4;11) high 


213539_at 


CD3D 


T-ALL high 


213894_at 


KIAA0960 


ALL t(4;11) high 


215925_s_at 




ALLt(4;11) high 


218224_at 


PNMA1 


T-ALL high 


219463_at 


C20orf103 


ALL t(4;11)high 


219631_at 


FLJ12929 

• 


T-ALL high 


225563_at 


ESTs 


ALL t(4;11) high 


225592_at 


NRM 


ALL t(4;11) high 


228083_at 


Homo sapiens mRNA; cDNA DKF2p434M216 (from clone 
DKFZp434l1216) 


ALL t(4;11) high 


228988_at 


ZNF6 


T-ALL high 


235749_at 




ALL t(8;14) high 


24241 4_at 


ESTs 


ALL t(4;11)high 


243756_at 


ESTs 


ALL t(4;11) high 
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201497_x_at 


MYH11 


AML inv(16) high 


228827_at 


Homo sapiens clone 25023 mRNA sequence 


AML t(8;21) high 


38487_at 


FU 12442 


AML t(15;17) high 


203074_at 


ANXA8 


AML t(15;17) high 


205528_s_at 


CBFA2T1 


AML t(8;21)high 


205529_s_at 


CBFA2T1 


AMLt(8;21) high 


206940_s_at 


POU4F1 


AML t(8;21) high 


211341_at 


POU4F1 


AMLt(8;21) high 


201496_x_at 


MYH11 


AML inv(16) high 


228660_x_at 


SEMA4F 


other high 


20271 8_at 


IGFBP2 


AML t(15;17) high 


205380_at 


PDZK1 


other high 


202746_at 




AML MLL low 


201596_x_at 


KRT18 


AMLt(8;21) low 


3421 0_at 


CDW52 


AMLt(15;17) low 


212850_s_at 


LRP4 


AML inv(16) high 
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228904 at 


ESTs 


AML t(8;21) low, AML 
t(15;17) low, AML inv(16) 
low, AML MLL low 


203151_at 


MAPI A 


AML t(8;21)low 


201137_s_at 


HLA-DPB1 


AML t(15;17) low 


200675_at 


CD81 


AML inv(16) low 


201425_at 


ALDH2 


AMLt(8;21)low 


202085_at 


TJP2 


AML inv(16) low 


20261 9_s_at 


PLOD2 


AML MLL low 


203092_at 


TIMM44 


AML inv(16) low 


204425_at 


ARHGAP4 


AMLt(15;17) low 


205366 s at 


HOXB6 


AML t(8;21) low, AML 
t(15;17) low, AML inv(16) 
low, AML MLL low 


205472_s_at 


DACH 


AML MLL high 


206761 _at 


TACTILE 


AML MLL low 


222166_at 




AML +8 low 


222335_at 


ESTs 


AML MLL low 


22331 8_s_at 


MGC10974 


AML complex low 
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225330_at 


Homo sapiens, done MGC:18216 !MAGE:41 56235, mRNA, 
complete cds 


AML inv(16) low 
i 


231277_x_at 


ESTs 


AML complex low 


635_s_at 


PPP2R5B 


other low 


202503_s_at 


KIAA0101 


CLL low 


202580_x_at 


FOXM1 


CLL low 


202709_at 


FMOD 


CLL high 


204882_at 


KIAA0053 


CLL high 


205049_s_at 


CD79A 


ALL high, CLL high 


205051 _s_at 


KIT 


AML high 


205382_s_at 


DF 


AML high 


205599_at 


TRAF1 


CML low CLL high 


206255_at 


BLK 


ALL high, CLL high 


206398_s_at 


CD19 


ALL high, CLL high 


210487_at 


DNTT 


ALL high 


210948_s_at 


LEF1 


ALL high, CLL high 


211352_s_at 


NCOA3 


CLL high 
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211404_s_at 


APLP2 


AML high 


214761_at 


0A2 


ALL high 


217950_at 


NOSIP 

• 


CLL high 


218090_s_at 




CLL high 


218516_s_at 


FLJ20421 


normal BM low 


218916_at 


FLJ23436 


normal BM low 


219753_at 


STAG3 


ALL high 


221969_at 


PAX5 


Aim i • i ■ i_ * i 

ALL high, CLL high 


223703_at 


CDA017 

i 


AML high, CML high, 
normal BM high 


226147_s__at 


Homo sapiens cDNA: FU22667 fis, clone HSI08385 


CLL high 


228471 _at 


ESTs 


CLL high 


229487_at 


ESTs 


ALL high 


229790_at 


TERF2 


CML low, BM low 


231736_x_at 


MGST1 


AML high, CML high, 
normal BM high 


231 854_at 


Homo sapiens cDNA FLJ11448 fis, clone HEMBA1001391 


CML low 
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239287_at 


ESTs 


CLL high 
i 


243362_s__at 


LEF1 


ALL high 


243363_at 


LEF1 


ALL high, CLL high 


41577_at 


PPP1R16B 


CML low 



Preferred methods for detection and quantification of the amount of nucleic acids, 
i.e. for the methods according to the invention allowing the determination of the 
level of expression of a marker or a group of markers, are those described by 
5 Sambrook et al. (1989) or real time methods known in the art as the TaqMan® 
method disclosed in WO92/02638 and the corresponding US patents US 
5,210,015, US 5,804,375, US 5,487,972. This method exploits the exonuclease 
activity of a polymerase to generate a signal. In detail, the (at least one) target 
nucleic acid component is detected by a process comprising contacting the 

10 sample with an oligonucleotide containing a sequence complementary to a region 
of the target nucleic acid component and a labeled oligonucleotide containing a 
sequence complementary to a second region of the same target nucleic acid 
component sequence strand, but not including the nucleic acid sequence defined 
by the first oligonucleotide, to create a mixture of duplexes during hybridization 

15 conditions, wherein the duplexes comprise the target nucleic acid annealed to the 
first oligonucleotide and to the labeled oligonucleotide such that the 3'-end of the 
first oligonucleotide is adjacent to the 5'-end of the labeled oligonucleotide. Then 
this mixture is treated with a template-dependent nucleic acid polymerase having a 
5' to 3' nuclease activity under conditions sufficient to permit the 5' to 3' nuclease 

20 activity of the polymerase to cleave the annealed, labeled oligonucleotide and 
release labeled fragments. The signal generated by the hydrolysis of the labeled 
oligonucleotide is detected and/ or measured. TaqMan® technology eliminates the 
need for a solid phase bound reaction complex to be formed and made detectable. 
Other methods include e.g. fluorescence resoance energy transfer between two 

25 adjacenly hybridized probes as used in the LightCycler® format described in US 
6,174,670. 
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Protocols for carrying out the methods according to the invention are known to the 
expert in the field and are described in the examples, preferably in example 1 and 
4. A preferred protocol is described in Example 1(A), where total RNA is isolated, 
cDNA synthesized and biotin incorporated during the transcription reaction. The 
5 purified cDNA was applied to commercially available arrays which can be obtained 
e.g. from Affymetrix. The hybridized cDNA is detected according to the methods 
described in Example 1(A). The arrays are produced by photolithography or other 
methods known to experts skilled in the art e.g. from US5,445,934, US5,744,305, 
US5,700,637, US5,945,334 and EP619 321 or EP 373 203. The latter methods 

10 are also suitable for producing the composition according to the inventions in 
particular the composition wherein polynucleotides or oligonucleotides are bound 
to a solid phase in particular in the form of arrays. In a further preferred 
embodiment of the methods according to the invention, a transcribed 
polynucleotide or portion thereof is the marker or at least one of the markers. A 

15 particularly preferred transcribed polynucleotide is an mRNA or a cDNA. In a 
preferred embodiment of the methods according to the! invention, the step of 
determining the expression profile further comprises amplifying the transcribed 
polynucleotide. In another preferred embodiment, the level of expression, i.e. the 
expression profile, of the group of transcribed polynucleotides is determined by 

20 annealing the transcribed polynucleotides with a complementary polynucleotide or 
a portion thereof under stringent hybridization conditions. The term "stringent 
hyberidisation conditions" is equivalent to the term "highly stringent hyberdisation 
conditions". Such highly stringent hybridization conditions may be determined in 
accordance with the teachings provided in Hames and Higgins (eds) "Nucleic acid 

25 hybridization, a practical approach", IRL Press 1985, Oxford, and include 
hybridization at 55-65°C in 0.2-0.5xSSC, 0.1% SDS followed by appropriate 
washing conditions such as 0.5-1 xSSC at 55°C and 0.1% SDS. 

In a most preferred embodiment, the patient sample is blood, i.e. blood 
30 mononuclear cells, or bone marrow, i.e. mononuclear cells. The methods 
according to the invention may be performed on fresh or frozen blood, i.e. blood 
mononuclear cells or bone marrow, i.e. mononuclear cells. 
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In a preferred embodiment the marker or at least one of the markers is a protein. 
In another preferred embodiment the expression profile of the proteins is detected 
using a reagent which specifically binds to one of the proteins whereby preferably 
the reagent is selected from the group consisting of an antibody, an antibody 
5 derivative, andan antibody fragment. 

The term "antibody" comprises monoclonal antibodies as first described by Kohler 
and Milstein in Nature 278 (1975), 495-497 as well as polyclonal antibodies, i.e. 
entibodies contained in a polyclonal antiserum. Monoclonal antibodies include 
10 those produced by transgenic mice. Fragments of antibodies include F(ab')2, Fab 
and Fv fragments. Derivatives of antibodies include scFvs, chimeric and 
humanized antibodies. See, for example Harlow and Lane, loc. c\i. 

Another embodiment of the invention is a kit preferably for assessing the suitability 

15 of each of a plurality of compounds for inhibiting leukemia in a patient, the kit 
optionally comprising the plurality of compounds; and a reagent for assessing the 
expression profile of a group of markers characterized in that the group of markers 
consists of markers selected independently from the markers listed in one or more 
of the tables 1 to 20, tables 25 or 27 or tables 29, 30, 32, 33, 35, 36, 38, 39, 41, 42 

20 and whereby the number of markers in the group is between two and the total 
number of markers listed in the tables 1 to 20, tables 25 or 27 or tables 29, 30, 32, 
33, 35, 36, 38, 39, 41 , 42. Another embodiment is a kit preferably for assessing 
whether a patient is afflicted with leukemia, the kit comprising reagents for 
assessing the expression profile of a group of markers characterized In that the 

25 group of markers consists of markers selected independently from the markers 
listed in one or more of the tables 1 to 20, tables 25 or 27 or tables 29, 30, 32, 33, 
35, 36, 38, 39, 41 , 42 and whereby the number of markers in the group is between 
two and the total number of markers listed in the tables 1 to 20, tables 25 or 27 or 
tables 29, 30, 32, 33, 35, 36, 38, 39, 41, 42. Another embodiment is a kit 

30 preferably for assessing the presence of human leukemia cells, the kit comprising 
an antibody, wherein the antibody specifically binds with a protein corresponding 
to a marker characterized In that the marker is selected from the tables 1 to 20, 
tables 25 or 27 or tables 29, 30, 32, 33, 35, 36, 38, 39, 41, 42. Another 
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embodiment is a kit preferably for assessing the leukemia cell carcinogenic 
potential of a test compound, the kit comprising leukemia cells and a reagent for 
assessing expression of a marker, wherein the marker is selected from the tables 
1 to 20, tables 25 or 27 or tables 29, 30, 32, 33, 35, 36, 38, 39, 41 , 42. 

5 

Advantageously, the kit of the present invention further comprises, optionally (a) 
storage solution(s) and/or remaining reagents or materials required for the conduct 
of scientific and/or diagnostic and/or therapeutic methods. Furthermore, parts of 
the kit of the invention can be packaged individually in vials or bottles or in 
1 0 combination in containers or multicontainer units. 

Another embodiment of the invention is related to a protein or the RNA, cDNA or 

CRNA corresponding to a marker-selected from the tables 1 to 20, tables 25 or 27 

or tables 29, 30, 32, 33, 35, 36, 38, 39, 41, 42 or the use thereof for the treatment 
15 of or vaccination against leukemia. Alternatively and depending on the exact 
purpose, inhibitors of these compounds such as antibodies, fragments or 
derivatives thereof may be employed for said purpose. 

The invention also contemplates a method for the development or preparation of 
20 a pharmaceutical composition for the treatment of leukemia characterized in that a 
protein corresponding to a marker selected from the tables 1 to 20, tables 25 or 27 
or tables 29, 30, 32, 33, 35, 36, 38, 39, 41, 42 is admixed with pharmaceutical 
compounds. Another embodiment of the invention is related to a method for the 
development or preparation of a pharmaceutical composition for the treatment of 

25 leukemia characterized in that a vector comprising a polynucleotide encoding a 
protein corresponding to a marker selected from the tables 1 to 20, tables 25 or 27 
or tables 29, 30, 32, 33, 35, 36, 38, 39, 41, 42 is admixed with pharmaceutical 
compounds. Another embodiment of the invention is a method for the 
development or preparation of a pharmaceutical composition for the treatment of 

30 leukemia characterized in that an antisense oligonucleotide complementary to a 
polynucleotide encoding a protein corresponding to a marker selected from the 
tables 1 to 20, tables 25 or 27 or tables 29, 30, 32, 33, 35, 36, 38, 39, 41, 42 is 
admixed with pharmaceutical compounds. Alternatively, inhibitors such as 
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antibodies specific for the markers may be used for the preparation or 
development of a pharmaceutical composition. 

The term "pharmaceutical compounds" is preferably to be understood to mean 
5 pharmaceutical^ acceptable carriers, diluents or excipients, only in connection 
with the embodiments recited in this paragraph. In another embodiment of the 
invention a marker or a group of markers selected individually from one or more of 
the tables 1 to 20, tables 25 or 27 or tables 29, 30, 32, 33, 35, 36, 38, 39, 41 , 42 
is used for the determination of leukemia cells, the type or subtype of leukemia 
10 cells. 

In another embodiment, of the invention a marker or a group of markers selected 
individually from one or more of the tables 1, 2, 13, 14, 17, 25, 27, 35 or 36 is used 
for the determination of the subtype of AML cells. 

15 

In a preferred embodiment, the invention is related to a composition comprising a 
group of markers and substances chemically different to the markers 
characterized in that the group of markers consists of markers selected 
independently from the markers listed in one or more of the tables 1 to 20, tables 

20 25 or 27 or tables 29, 30, 32, 33, 35, 36, 38, 39, 41, 42 and whereby the number 
of markers in the group is between one, preferably two and the total number of 
markers listed in the tables 1 to 20, tables 25 or 27 or tables 29, 30, 32, 33, 35, 36, 
38, 39, 41 , 42. It is preferred that the composition according to the invention is 
characterized in that the group of markers consists of all markers listed in one or 

25 more of the tables 1 to 20, tables 25 or 27 or tables 29, 30, 32, 33, 35, 36, 38, 39, 
41 , 42. More preferred the composition according to the invention is characterized 
in that the group of markers consists of all markers listed in one or more of the 
tables 14, tables 16 to 20, or table 29 or 30, most preferred the group of markers 
consists of all markers listed in the tables 16 to 20 or tables 29 or 30. Preferably 

30 the markers are polynucletides or oligonucleotides, whereby the polynucleotides 
are bound to a solid phase in the form of an array. 
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The present invention also relates to a method of determining the subtypes of ALL 
cells in a patient sample comprising the steps of a) determining the level of 
expression of a group of markers in the patient sample and b) concluding from the 
differences in the level of expression which subtypes of ALL cells the patient 
5 sample contains characterized in that the group of markers consists of markers 
selected independently from the markers listed in one or more of the tables t8, 32 
or 33 and whereby the number of markers in the group is between two and the 
total number of markers listed in the tables 18, 32 or 33. 

10 Preferably the group of markers consists of all markers listed in one or more of the 
tables 18, 32 or 33. 

—The-present-invention-further-relates to a method of determining the subtypes of 
CLL cells in a patient sample comprising the steps of a) determining the level of 

15 expression of a group of markers in the patient sample and b) concluding from the 
differences in the level of expression which subtypes of CLL cells the patient 
sample contains characterized in that the group of markers consists of markers 
selected independently from the markers listed in one or more of the tables 38 or 
39 and whereby the number of markers in the group is between two and the total 

20 number of markers listed in the tables 38 or 39. 

It is preferred that the group of markers consists of all markers listed in one or 
more of the tables 38 or 39. 

The present invention is also related to a diagnostic composition comprising at 
25 least one nucleic acid molecule, preferably (a) single-stranded nucleic acid 
molecule(s), which is capable of specifically hybridizing to the mRNA of at least 
one gene listed in Table 1. The use of said nucleic acid molecules for diagnosis of 
leukemia subtypes, preferably based on microarray technology, offers the 
following advantages: (1 ) more rapid and more precise diagnosis, (2) easy to use 
30 in laboratories without specialized experience, (3) abolishes the requirement for 
analyzing viable cells for chromosome analysis (transport problem), (4) very 
experienced hematologists for cytomorphology and cytochemistry, 
immunophenotyping as well as cytogeneticists and molecularbiologists are no 
longer required, and (5) improves the subclassification of leukemia due to the 
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definition of new entities based on gene expression profiles in those subtypes that 
are not clearly defined with the methods of the prior art (class discovery). 

As used herein, the term "capable of specifically hybridizing" has the meaning of 
hybridization under conventional hybridization conditions, preferably under 
5 stringent conditions as described, for example, in Sambrook, J., et al., in 
"Molecular Cloning: A Laboratory Manual" (1989), Eds. J. Sambrook, E. F. Fritsch 
and T. Maniatis, Cold Spring Harbour Laboratory Press, Cold Spring Harbour, NY 
and the further definitions provided above. Also contemplated are nucleic acid 
molecules that hybridize at lower stringency hybridization conditions. Changes in 

10 the stringency of hybridization and signal detection are primarily accomplished 
through the manipulation, preferably of formamide concentration (lower 
percentages of formamide result in lowered stringency), salt conditions, or 
temperature. For example, lower stringency conditions include an overnight 
incubation at 37°C fn a solution comprising 6X SSPE (20X SSPE = 3M NaCI; 0.2M 

15 NaH2P04; 0.02M EDTA, pH 7.4), 0.5% SDS, 30% formamide, 100 mg/ml salmon 
sperm blocking DNA, followed by washes at 50°C with 1 X SSPE, 0.1% SDS. In 
addition, to achieve even lower stringency, washed performed following stringent 
hybridization can be done at higher salt concentrations (e.g. 5x SSC). Variations in 
the above conditions may be accomplished through the inclusion and/or 

20 substitution of alternate blocking reagents used to suppress background in 
hybridization experiments. The inclusion of specific blocking reagents may require 
modification of the hybridization conditions described above, due to problems with 
compatibility. 

As a hybridization probe (or primer) nucleic acid molecules can be used, for 
25 example, that have exactly or basically the nucleotide sequence of at least one of 
the genes depicted in the appended tables or parts of these sequences. The term 
nucleic acid molecule as used herein also comprises fragments which are 
understood to be parts of the nucleic acid molecules that are long enough to 
specifically hybridize to transcripts of at least one of the genes of the appended 
30 tables. These nucleic acid molecules can be used, for example, as probes or 
primers in a diagnostic assay. Preferably, the nucleic acid molecules of the 
present invention have a length of at least 8, 10, 12, 13, 15, 18 in particular of at 
least 20 and particular preferred of at least 25 nucleotides. The nucleic acid 
molecules of the invention or parts therefrom* can also be used, for example, as 
35 primers for a PCR reaction. The fragments used as hybridization probe can be 
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synthetic fragments that were produced by means of conventional synthesis 
methods. 

In a preferred embodiment, the diagnostic composition of the present invention 
comprises at least nucleic acid molecules which are capable of specifically 
5 hybridizing to the mRNAs of at least one of the genes listed in the appended 
tables, preferably 2-5, more preferably 8-12 genes. 

In a more preferred embodiment, the diagnostic composition of the present 
invention comprises at least nucleic acid molecules which are capable of 
specifically hybridizing to the mRNAs of at least one of the genes listed in the 
10 appended tables. In a further preferred embodiment, the diagnostic composition of 
the present invention comprises at least nucleic acid molecules which are capable 
of specifically hybridizing to the mRNAs of all genes listed in the appended tables. 

In a further preferred embodiment, the nucleic acid molecules of the diagnostic 
composition of the present invention are bound to (a) a solid support, for example, 
15 a polystyrene microtiter dish or nitrocellulose membrane or glass surface or (b) to 
non-immobilized particles in solution. 

In an even more preferred embodiment, the nucleic acid molecules of the 
diagnostic composition are present in a microarray format which can be 
established according to well known methods; for details see, e.g., 
20 www.affymetrix.com/technology/tech_spotted.html; 
www.affymetrix.com/technology/tech_probe.html. 

The present invention also provides the use of (a) nucleic acid molecule(s) of the 
present invention for the preparation of a diagnostic composition for the diagnosis 
of a leukemia or for the diagnosis of several subtypes or a disposition to a 

25 leukemia. For the diagnosis of a particular leukemia subtype, preferably, at least 5 
different nucleic acid molecules are used as probes. For diagnosis, preferably, 
bone marrow or peripheral blood can be used. For diagnosis, the target sample is 
contacted with a (a) nucleic acid molecule(s) of the present invention and the 
concentration of individual mRNAs is compared with the mRNA expression profile 

30 levels of a test sample obtained from healthy donors. 

It is a further embodiment of the invention to provide a method of determining 
whether a patient sample contains leukemia cells or other cells and at the same 
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time determining the type and subtype of leukemia cells, comprising the steps of 
providing a patient sample, isolating RNA from the patient sample, transcribing the 
RNA into cDNA and transcribing the cDNA into cRNA while simultaneously 
labelling the cRNA, hybridising the cRNA to a microarray, and determining the 
5 level of expression of a marker or a group of markers. 

Further, the invention contemplates the use of a marker or a group of markers for 
determining whether a patient sample contains leukemia cells or other cells and 
whereby preferably the type and subtype of leukemia cells is simultaneously or 
subsequently is determined. The markers specified in the appended examples and 
10 tables may, in accordance with the invention, be used to differentiate, for example, 
between ALL, CLL, CML and AML. 

The nucleic acid molecule is typically a nucleic acid probe for hybridization or a 
primer for PCR. The person skilled in the art is in a position to design suitable 
nucleic acids probes based on the information provided in in the appended tables. 

The target cellular component, i.e. mRNA e.g., in bone marrow or blood (BM) may 
be detected directly in situ, e.g. by in situ hybridization or it may be isolated from 
other cell components by common methods known to those skilled in the art 
before contacting with a probe. Detection' methods include Northern blot analysis, 
RNase protection, in situ methods, e.g. in situ hybridization, in vitro amplification 
methods (PCR, LCR, QRNA replicase or RNA-transcription/amplification (TAS, 
3SR), reverse dot blot disclosed in EP 0 237 362)) and other detection assays that 
are known to those skilled in the art. Preferably, detection is based on a 
microarray. 

Amplification methods include the polymerase chain reaction (PCR) which 
25 specifically amplifies target sequences to detectable amounts. Other possible 
amplification reactions are the ligase Chain Reaction (LCR, Wu and Wallace, 
1989, Genomics 4:560-569 and Barany, 1991, Proc. Natl. Acad. Sci. USA 88:189- 
193); Polymerase Ligase Chain Reaction (Barany, 1991, PCR Methods and 
Applic. 1:5-16); Gap-LCR (PCT Patent Publication No. WO 90/01069); Repair 
30 Chain Reaction (European Patent Publication No. 439,182 A2), 3SR (Kwoh et al., 
1989, Proc. Natl. Acad. Sci. USA 86:1173-1177; Guatelli et al., 1990, Proc. Natl. 
Acad. Sci. USA 87:1874-1878; PCT Patent Publication No. WO 92/0880A), and 
NASBA (U.S. Pat. No. 5,130,238). Further, there are strand displacement 
amplification (SDA), transciption mediated amplification (TMA), and QD- 



15 
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amplification (for a review see e.g. Whelen and Persing (1996). Annu. Rev. 
Microbiol. 50, 349-373; Abramson and Myers, 1993, Current Opinion in 
Biotechnology 4:41 -47). 

Products obtained by in vitro amplification can be detected according to 
5 established methods, e.g. by separating the products on agarose gels and by 
subsequent staining with ethidium bromide. Alternatively, the amplified products 
can be detected by using labeled primers for amplification or labeled dNTPs. 

The probes can be detectably labeled, for example, with a radioisotope, a 
bioluminescent compound, a chemiluminescent compound, a fluorescent 
10 compound, a metal chelate, biotin or an enzyme. 

The invention further contemplates a method of making an isolated hybridoma 
which produces an antibody useful for assessing whether a patient is afflicted with 
leukemia, the method comprising isolating a protein corresponding to a marker 
selected from the group consisting of the markers listed in Tables 1 to 20, tables 

15 25 or 27 or tables 29, 30, 32, 33, 35, 36, 38, 39, 41, 42 immunizing a mammal 
using the isolated protein, or a peptide corresponding to its sequence or a part 
thereof; isolating splenocytes from the immunized mammal-, fusing the isolated 
splenocytes with an immortalized cell line to form hybridomas; and screening 
individual hybridomas for production of an antibody which specifically binds with 

20 the protein to isolate the hybridoma. Further, an antibody produced by this method 
is contemplated by the invention. The antibody may be fragmented or derivated to 
obtained fragment or derivatives retaining the antibody specificity as has been 
described herein above. 

25 The invention further contemplates a method of assessing the leukemia cell 
carcinogenic potential of a test compound, the method comprising maintaining 
separate aliquots of leukemia cells in the presence and absence of the test 
compound; and comparing expression of a marker in each of the aliquots, wherein 
a significantly altered level of expression of the marker in the aliquot maintained in 

30 the presence of the test compound, relative to the aliquot maintained in the 
absence of the test compound, is an indication that the test compound possesses 
human breast cell carcinogenic potential wherein a marker according to the 
invention is used. 
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The invention further contemplates a system for identifying selected polynucleotide 
records that identify a leukemia cell, the system comprising:a digital computer-, a 
database coupled to the computer; a database coupled to the database server 
"having data stored in, the data comprising records of data comprising a 
5 polynucleotide corresponding to a marker according to the invention and a code 
mechanism for applying queries based upon a desired selection criteria to the data 
file in the database to produce reports of polynucleotide records which match the 
desired selection criteria. 

10 The invention also relates to a method for detecting a leukemia cell, using a 
computer having a processor, memory, display, and input/output devices, the 
method comprising the steps of 

a) providing a sequence of a polynucleotide isolated from a sample suspected 

of containing a leukemia cell, 
15 b) providing a database comprising records of data comprising a polynucleotide 

corresponding to a group of markers according to the invention; 

c) using a code mechanism for applying queries based upon a desired selection 

criteria to the data file in the database to produce reports of polynucleotide records 

of step a) which provide a match of the desired selection criteria of the sequences 
20 in the database of step b), the presence of a match being a positive indication that 

the polynucleotide of step 1) has been isolated from a cell that is a-leukemia cell. 

Also, the present invention relates to a method for assessing the leukemia cell 
carcinogenic potential of a test compound, comprising (a) contacting a non- 
25 leukemia cell with a test compound, and (b) assessing an increase or decrease of 
marker expression in said non-leukemia cell wherein the marker is selected from 
the tables 1 to 20, 25 or 27, 29, 30, 32, 33, 35, 36, 38, 39, 41 or 42. 

The assessment may be effected on the nucleic acid level such as by hybridization 
techniques or PCR or on the protein level such as by using antibody or aptamers 
30 based technologies. 

Finally, the invention relates to a diagnostic composition comprising at least one 
nucleic acid molecule which is capable of specifically hybridizing to the mRNA 
corresponding to the marker gene of any of the appended tables. The nucleic acid 
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molecule may be an antisense DNA or RNA an RNAi molecule a siRNA molecule 
or the like inhibitory molecule capable of specifically blocking trnascription and/or 
translation and/or modification and/or localization of the RNA and/or protein 
corresponding to the marker gene. 

5 

The nucleic acid may also be a sense-strand nucleic acid e.g. RNA or preferably 
DNA which is capable of expressing the protein product of the marker gene, or a 
protein product of substantially similar activity, in a target cell into which it is 
introduced. 

10 The invention further comprises pharmaceutical compositions comprising a 
compound capable of specifically binding to a protein or RNA corresponding to a 
marker of the invention as listed in any of the appended tables. The marker is 
preferably selected from the markers designated as particular preferred markers 
as described herein above . The compound is preferably a compound capable of 

15 inhibiting or increasing the function of the protein or of enhancing or decreasing 
translation of the RNA. The compound is preferably selected from aptameres, 
aptazynes, RNAzynes, antibodies, affybodies, trinextins, anticalins, or the like 
compounds. The effect of the compounds on the RNA may be tested by assaying 
for increased/decreased synthesis of the corresponding protein. The effect of the 

20 compounds on the protein may be assayed the testing the effect of the compound 
in an assay of the proteins function, t which e.g. may be an anzymathic function. 
Alternatively, the effect may be tested by contacting a leukemic cell that expresses 
large amounts of such protein with the compound and assay cellular parameters 
associated with the leukemic state of the cell, such as cell growth, growth factor 

25 dependency and/or differentiation state of the cell. 

In a further embodiment, the invention provides a method of determining wether a 
patient sample contains leukemia cells or other cells comprising the steps of 

a) determining the expression profile of a group of markers in a patient sample 
and 

30 b) concluding from the expression profile whether the patient sample contains 
leukemia cells or other cells, and optionally, to which subtype said leukemia 
cells belong, wherein 
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a subtype or a type of leukemia listed in table 28 b or c is identified, and a 
sensitivity and/or specificity of at least 80, 85, 88, 90, 92,' 95, 97, 98, 99, 99.1, 
99.2, 99.3, 99.4 or 99.5% is achieved, preferably using at least one marker of 
the group of markers listed in table 29 and/or 30. 

5 

In a further embodiment, the invention provides a method of determining wether a 
patient sample contains leukemia cells or other cells comprising the steps of 

(a) determining the expression profile of a group of markers in a patient sample 
and 

1 0 (b) concluding from the expression profile whether the patient sample contains 
leukemia cells or other cells, and optionally, to which subtype said leukemia 
cells belong, wherein 

a subtype or a type' of leukemia listed in tabte 31 b or c is identified, and a 
sensitivity and/or specificity of at least 80, 85, 88, 90, 92, 95, 97, 98, 99, 99.1, 
15 99.2, 99.3, 99.4 or 99.5% is achieved, preferably using at least one marker of 
the group of markers listed in table 32 and/or 33. 



In a further embodiment, the invention provides a method of determining wether a 
patient sample contains leukemia cells or other cells comprising the steps of 

20 (a) determining the expression profile of a group of markers in a patient sample 
and 

(b) concluding from the expression profile whether the patient sample contains 
leukemia cells or other cells, and optionally, to which subtype said leukemia 
cells belong, wherein 

25 a subtype or a type of leukemia listed in table 34 b or c is identified, and a 
sensitivity and/or specificity of at least 80, 85, 88, 90, 92, 95, 97, 98, 99, 99.1, 
99.2, 99.3, 99.4 or 99.5% is achieved, preferably using at least one marker of 
the group of markers listed in table 35 and/or 36. 
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In a further embodiment, the invention provides a method of determining wether a 
patient sample contains leukemia cells or other cells comprising the steps of 

(a) -determining the expression profile of a group of markers in a patient 
5 sample and 

(b) concluding from the expression profile whether the patient sample contains 
leukemia cells or other cells, and optionally, to which subtype said leukemia 
cells belong, wherein 

a subtype or a type of leukemia listed in table 37 b or c is identified, and a 
10 sensitivity and/or specificity of at least 80, 85, 88, 90, 92, 95, 97, 98, 99, 99.1, 
99,2, 99.3, 99.4 or 99.5% is achieved, preferably using at least one marker of 
the group of markers listed in table 38 and/or 39. 



In a further embodiment, the invention provides a method of determining wether a 
15 patient sample contains leukemia cells or other cells comprising the steps of 

(a) determining the expression profile of a group of markers in a patient sample 
and 

(b) concluding from the expression profile whether the patient sample contains 
leukemia cells or other cells, and optionally, to which subtype said leukemia 

20 cells belong, wherein 

a subtype or a type of leukemia listed in table 40 b or c is identified, and a 
sensitivity and/or specificity of at least 80, 85, 88, 90, 92, 95, 97, 98, 99, 99.1, 
99.2, 99.3, 99.4 or 99.5% is achieved, preferably using at least one marker of 
the group of markers listed in table 41 and/or 42. 

25 
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Description of the Figures 



Figure 1a: 


Principal Component Analysis 


Figure 1 b: 


Hierarchical Cluster Analysis 


Figure 2: 


Classification Accuracy 


Figures 3a, 


PCA of AML data based on 312 genes 


3b1 , 3b2: 


Decision Trees according to l(E) 


Figure 4: 


Pair-wise Comparison of Normal BM and AML 


Figure 5a: 


Principal Component Analysis 


Figure 5b: 


Hierarchical Cluster Analysis 


Figure 5c: 


Pair-wise Comparison of Normal BM and ALL 


Figure 6a: 


Principal Component Analysis 


Figure 6b: 


Hierarchical Cluster Analysis 


Figure 6c: 


Pair-wise Comparison of Normal BM and CML 


Figure 7a: 


Principal Component Analysis 


Figure 7b 


Hierarchical Cluster Analysis 


Figure 7c: 


Pair-wise Comparison of Normal BM and CLL 


Figure 8a: 


Principal Component Analysis 


Figure 8b: 


Hierarchical Cluster Analysis 


Figure 8c: 


AML-WHO Classification 


Figure 9a: 


Principal Component Analysis 


Figure 9b: 


Hierarchical Cluster Analysis 


rigure yc. 


comparison ot iNormai dm versus LeuKemia 


Figure 1 0a: 


Principal Component Analysis 


Figure 10b: 


Hierarchical Cluster Analysis 


Figure 10c: 




Figure 11a 


Accurate diagnosis of leukemia is accomplished in a two-step 
approach. First, samples are assigned to one of the major 
leukemia types or normal BM, respectively. Then, if positive for 
ALL or AML, further subclassification based on cytogenetically 
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defined characteristics is proposed. In total 111 samples were 
analyzed by gene expression profiling and implemented in the 
development of different class prediction models: normal BM 
(n=8)r CLlr(n=8), CML (n=10), ALL (n=18), and AML (n=59). 18 
ALL samples can further be characterized by B-lineage ALL 
samples positive for t(8;14) (n=3), t(9;22) (n=7), or t(11q23)/MLL 
(n=4) and T-lineage ALL (n=3), respectively. Additionally, one B- 
ALL sample showed an aberrant karyotype. 59 AML samples 
were comprized of normal karyotype (n=3), complex aberrant 
karyotype (n=4), trisomy 8 as sole abnormality (n=3), t(8;21) 
(n=9), t(15;17) (n=16), inv(16) (n=10), and t(11q23)/MLL (n=10). 
The latter four AML entities were additionally represented by 
each of the following t(8;21),+8 (n=1), t(15;17),+8 (n=2), and 
inv(16),+8 (n=1). Furthermore, some expression profiles were 

pypliidprl for dpv/plonmpnt of thp pla^^ifipr hut ftuhspnupntlv 

CAvlUVJCVi 1 \J 1 VIC VCIUUI 1 1CI 11 VI 11 IC v/ICtOOIIIwl U Li I OUWOV/V^U V/l Illy 

tested for performance in diagnostical class assignments: normal 
BM (n=1), CLL (n=2), CML (n=2), ALL with t(4;11) (n=1), and 
AML with t(1 5; 17) (n=2), respectively. 


Figure 11b: 


Hierarchical clustering of 55 AML samples (rows) versus 25 
informative genes (columns). In total, 15 comparisons within the 5 
groups were performed (pairwise and one-versus-all). Genes 
were selected for maximal accuracy and confidence based on a 
modified signal-to-noise (S2N) algorithm. The scaled gene 
expression levels are coded by intensity and shown on a scale 
from black (no expression) to bright red (highest expression). The 
AML subgroups 'other' (n=10), t(11q23)/MLL (n=10), inv(16) 
/n=10^ (n=9) and tM5"17^ /n=16^ are colored accordina 
to their chromosomal aberrations. The minimal set of informative 
genes is given by HGNC approved symbols (not yet approved 
genes are marked by asterisks). 


Figure 11c 


Hierarchical clustering of 17 ALL samples (rows) versus 19 
informative genes (columns). In total, 10 pairwise or OVA 
comparisons within the 4 groups were performed. Genes were 
selected for maximal accuracy and confidence based on a 
modified S2N algorithm. The scaled gene expression levels are 
coded by intensity and shown on a scale from black (no 
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expression) to bright red (highest expression). The ALL 
subgroups t(1 1 q23)/MLL (n=4), t(9;22) (n=7), t(8;14) (n=3), and 
T-AI I /n— 3\ arp nnlnrpH aonorHinn to thoir nhflrppff^riQtipal 

i r"vi— i— \ i i — w J c*i w wiui a^uui un ly iu u loll Idi cioici lOLlOdl 

chromosomal aberrations or immunophenotype. The minimal set 
of informative genes is given by HGNC approved symbols 
(asterisks mark not yet approved genes). 


Figure 12a - 
12i 


Bar graphs of gene expression intensities for distinct leukemia 
types and subtypes. A short description indicates the respective 
classes which can be distinguished at each case. 


Figure 13a. 


Dot plot of expression levels for a particular gene in two groups 
(e.g. group1= normal samples, group2 = disease samples). 
Golub's decision limit to distinguish between groupl and group2, 
which is defined as the mean of //1 and jj 2 (j/ a : mean expression 
in group a), is not optimal, because the standard deviations of 

npnp PYnrPQQinn lox/olc \A/ith 1 n tho t\A/Pi rirni I r\c aro won/ Hiff cvrorvt 1 r\ 
yd ic ca[ji cooiui i icvcio vviii iu i ii ic iwu yiLJU [Jo ait/ vciy unit?! t?i u. ill 

this case, a lower limit (e.g. maximum level in groupl) would 
have been more appropriate to separate the two groups by 
means of gene expression levels. 


Figure 13b 


Accuranv anri nnnfirlpnpp for p*ll-n£jir<? ?*nrl rinp-x/prQiiQ-pll 

nuuuia^y c*i iu uuiiiiudivc iwi cut yjdi I o di iu \JltXS vgioUo cxw 

comparisons in a dataset consisting of 103 samples from 5 
classes (A,B,C,D,E) using Golub's method and diffgenes. Both 
accuracy and confidence are higher with diffgenes. 


Figure 14 


Detailed characteristics of the 37 AML cases representing three 
defined cytogenetic aberrations corresponding to four 
cytomorphological subtypes according to FAB classification: 
inv(16)(p13q22)/AML M4eo, t(8;21)(q22;q22)/AML M2, and 
t(15;17)(q22;q12)/AML M3 or M3v. Diagnosis was proven by a) 
karyotype analysis, b) interphase-FISH (CBFB, AML1 and ETO, 
PML and RARA), c) RT-PCR (CBFB-MYH1 7, AML 1-ETO, PML- 
RARA), and d) cytomorphology. 


Figure 15 


Figure 15: Three cytogenetically defined AML subtypes with 
t(15;17), t(8;21) or inv(16) can be separated based on their gene 
expression profiles of 1,000 preselected genes. The three 
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different subgroups form distinct clusters. For visualization in a 
two-dimensional plot the first two principal components were 
chosen as they captured most of the variation in the original data 
set. The subgroups are coloured according to their chromosomal 
aberrations, respectively 


Figure 16 


Hierarchical cluster analysis of the gene expression pattern of the 
set of 13 predictor genes as identified according to the adapted 
class prediction methodology introduced by Golub et al. The 
three distinct cytogenetic AML subgroups can clearly be 
separated based on their gene expression profiles. Each row 
represents a leukemia sample and each column a gene. The 
gene accession numbers are shown on the top. Varying 
expression levels are shown on a scale from black (no gene 
expression) to bright red (highest expression). The subgroups are 
coloured according to their chromosomal aberrations, 
respectively. 


Figure 17 


Schematic representation of the 15 decision trees (a to o) used in 
the multiple-tree classifier. Arrows indicate high (arrow up) or low 
(arrow down) expression, "0" and "+" denote absence or 
presence of a gene, respectively (e.g., in (a) the low expression 
of X96719 indicates AML with t(15;17) whereas the high 
expression of X96719 indicates AML with inv(16) or AML with 
t(8;21); the latter two entities are distinguished by X53742: lack of 
expression identifies AML with inv(16) and positive expression 
predicts AML with t(8;21)). The GenBank accession numbers are 
given for genes the expression level of which are used for 
decision. Nodes are represented as ovals and leaves as 
rectangles. Classes are referred to as t(15;17), t(8;21) or inv(16). 


Figure 18 


Based on a preselection of 82 genes morphologically different but 
oyiogeneiicaiiy identical mivil suoiypes Mo witn i( 10, 1 /) ano Mov 
with t(15;17) can be separated based on their gene expression 
profile. AML M3 samples are shown as green dots, AML M3v 
samples as blue dots, respectively. 
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Figure 19: 


Correlations between protein expression levels and mRNA 
abundance. Expression levels were compared by Pearson's 
correlation. Mean fluorescence intensity values obtained by flow 
cytometry were calculated for all events with fluorescence values 
nigner man isotype controls using the oeiicjuest Hro software 
(Beckton Dickinson). Average fluorescence intensity values 
obtained by micorarray analyses were calculated by the 
Affymetrix software, Microarray Suite, Version 4.0.1. 


rigure 20 


Detailed characteristics ot the 45 AML cases representing tour 
defined recurrent cytogenetic abnormalities. Diagnosis was 
proven oy a; Karyo-iype analysis, D; interpnase-rion, c; hi- 
PCR, and d) cytomorphology. 


Fig. 21 


Class separation by principal component analysis (PCA) 


Fig. 22 


Figure 3: PCA-Plot based on 39 informative genes. All leukemia 
samples could accurately be assigned to their corresponding 
cytogenetic subtype with 100% accuracies. To illustrate these 
results, a hierarchical clustering is shown (Fig. 4). 


Fig. 23 


Hierarchical clustering of 44 diagnostic AML samples and 8 
normal BM samples (columns) versus 39 informative genes 
(rows). Gene expression levels are coded by intensity and 
represented on a scale from black (no expression) to bright red 
(highest expression). 


Fig. 24 to 465 


Bar graphs of gene expression intensities for distinct leukemia 
types and subtypes or normal bone marrow, respectively, 
ofcjicoitju bidiibiiuciuy oiy nil iGcuii genes are given oy MTiyrneinx 
identification number and Human Gene Nomenclature Committee 
approved symbol (where available). A short description indicates 
the respective classes which can be distinguished at each case. 
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The following examples, references, sequence listing and figures are provided to 
aid the understanding of the present invention, the true scope of which is set forth 
in the appended claims. It is understood that modifications can be made in the 
5 procedures set forth without departing from the spirit of the invention. 



Examples 

EXAMPLE 1 

EXAMPLE 1 - General Methods 

10 EXAMPLE 1 - (A) Selection and characterisation of Leukemia Samples 

Bone marrow (BM) aspirates were taken at the time of the initial diagnostic biopsy 
and remaining material was immediately lysed in RLT buffer (Qiagen), frozen and 
stored at -80 C until preparation for gene expression analysis. For microarray 
analysis the GeneChip System (Affymetrix, Santa Clara, CA, USA) was used. The 

15 targets for GeneChip analysis were prepared according to the current Expression 
Analysis. Briefly, frozen lysates of the leukemia samples were thawed, 
homogenized (QIAshredder, Qiagen) and total RNA extracted (RNeasy Mini Kit, 
Qiagen). Normally 10 ug total RNA isolated from 1 x 107 cells was used as starting 
material in the subsequent cDNA-Synthesis using Oligo-dT-T7-Promotor Primer 

20 (cDNA synthesis Kit, Roche Molecular Biochemicals). The cDNA was purified by 
phenol-chlorophorm extraction and precipitated with 100% Ethanol over night. For 
detection of the hybridized target nucleic acid biotin-labeled ribonucleotides were 
incorporated during the in vitro transcription reaction (Enzo® BioArray™ 
HighYield™ RNA Transcript Labeling Kit, ENZO). After quantification of the 

25 purified cRNA (RNeasy Mini Kit, Qiagen), 15 ug were fragmented by alkaline 
treatment (200 mM Tris-acetate, pH 8.2, 500 mM potassium acetate, 150 mM 
magnesium acetate) and added to the hybridization cocktail sufficient for 5 
hybridizations on standard GeneChip microarrays. Before expression profiling 
Test3 Probe Arrays (Affymetrix) were chosen for monitoring of the integrity of the 
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cRNA. Only labeled cRNA-cocktails which showed a ratio of the messured 
intensity of the 3' to the 5' end of the GAPDH gene less than 3.0 were selected for 
subsequent hybridization on HG-U95Av2 probe arrays (Affymetrix). Washing and 
staining the Probe arrays was performed as described (siehe Affymetrix-Original- 
5 Literatur (LOCKHART und LIPSHUTZ). The Affymetrix software (Microarray Suite, 
Version 4.0.1) extracted fluorescence intensities from each element on the arrays 
as detected by confocal laser scanning according to the manufacturers 
recommendations. 

10 EXAMPLE 1 - (B) Data analysis 

Class separation by principal component analysis and hierarchical cluster 
analysis: In a first step we reduced the dimensionality of the number of genes. 
Therefore we scaled the data from each array to a target intensity value 50 
(Affymetrix Microarray Suite) in order to be able to perform inter-array 

15 comparisons. Then all data was analyzed using Significance Analysis of 
Microarrays (Multiclass Response, Stanford University) and we selected a distinct 
number of genes based on a permutations test. This reduced set of genes which 
showed to be significant then was analyzed using the public available Java 
application J-Express analysis tool (download at www.molmine.com). Principal 

20 Component Analysis and Hierarchical Cluster Analysis (parameters Cluster 
method: single linkage and Distance metric: euclidean) showed a clear separation 
of analyzed groups of samples e.g. healthy bone marrow versus leukemia. 



EXAMPLE 1 - (C) Identification of differentially expressed genes according to 
25 Golub et al. (Science 1999 Oct 15;286(5439):531-7) 

A previously described (Science 1999 Oct 15;286(5439):531-7) was modified to 
reduce the number of candidate genes that could distinguish between our 
leukemic samples of interest. In a first step the raw data was scaled using 
Affymetrix software (target intensity 50 for all genes). To avoid division by zero or 
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negative numbers as occuring due to the current expression algorithm (Affymetrix) 
we set all average intensities of 20 or less to 20. Briefly, for a more detailed gene 
expression profiling we applied the data analysis method according to Golub et al. 
using weighted voting. In a first step gene expression levels were log-transformed 
5 with a cut-off value set at 20 units. To assess the significance of selected genes 
we performed a leave-one-out cross-validation. Only those genes were considered 
important which were contained in all cross validation classificators. To determine 
the association between genes by chance we performed a permutation test (100 
cycles). Because the number of informative genes, which are able to discriminate 
10 between samples, is unknown, we applied the Golub method for different numbers 
of informative genes (range: 10-200). The minimal set of genes which provided 
optimal classification accuracy was selected to avoid overfitting. 



EXAMPLE 2 

15 EXAMPLE 2 - Identification of genes, the aberrant expression of which is 
associated with a particular leukemia subtype 

Monitoring the gene expression level of thousands of mRNA transcripts 
simultaneously in one experiment is the key technology to find out the specific 
genes which allow the subsequent development of a class prediction model. We 
20 therefore used the Affymetrix oligonucleotide microarray technology (GeneChip® 
Instrument System) to obtain gene expression profiles of each individual clinical 
sample of interest. The HG-U95Av2 probe arrays gave us information about the 
relative mRNA abundance of about 12,000 full length human genes which are 
represented on these high-density oligonucleotide microarrays. 

25 In total, 8 bone marrow samples of healthy volunteers and leukemia patients were 
investigated. Five different types of bioinformatic calculations were performed. 
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EXAMPLE 2 (1) Three distinct genetic subtypes of AML 

Three defined cytogenetic aberrations t(8;21)(q22;q22) (n=9), t(15;17)(q22;q12) 
(n=16) and M4eo with inv(16) (p13q22) (n=10) corresponding to the 4 FAB- 
subtypes AML M2, M3,or M3v and M4eo, respectively. After we obtained bone 

5 marrow aspirates from 35 untreated patients with newly diagnosed AML, all, cases 
were characterized by cytomorphology, cytogenetics and by molecular genetics 
(Fig. 1). AML subtypes M3 and M3v both carry the same chromosomal aberration 
but differ in morphological aspects like nuclear configuration, granulation and 
clinical aspects white blood cell count (WBC), respectively. In all cases, these 

10 balanced abnormalities were confirmed by fluorescence in-situ hybridization. The 
corresponding fusion transcript was detected by RT-PCR and/or quantitative real 
time PGR.- The median age of the patients- was 53-years (range, 19-82 years) and 
did not differ between the respective groups. The median WBC count was 17.0 G/l 
(range, 0.8-168.0 G/l) and was strikingly lower in patients with AML M3 as 

15 compared to all other patients. 



EXAMPLE 2 - Methods used 

EXAMPLE 2 - (A) Selection and characterisation of Leukemia Samples 

We obtained bone marrow (BM) aspirates from 37 AML patients standing for four 
20 morphological and three underlying cytogenetic subgroups that were sent to the 
Laboratory of Leukemia Diagnostics (LFL) for central diagnosis within the German 
AMLCG study (Klinikum Grosshadern, Munich, Germany). They were selected for 
this study on the basis of several criteria. It was mandatory that none of the 
patients had been treated. All samples, exclusively newly diagnosed in our 
25 laboratory, had to be well characterized as de novo AML and diagnosis had been 
proven by cytomorphology, cytogenetics, flow cytometry and molecular genetics in 
every single case. All samples for gene expression analysis were taken at the time 
of the initial diagnostic biopsy when remaining material was immediately lysed in 
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RLT buffer (Qiagen), frozen and stored at -80 C until preparation for gene 
expression analysis. 



EXAMPLE 2 - (B) Microarrav experiments 

5 For microarray analysis the GeneChip System (Affymetrix, Santa Clara, CA, USA) 
was used. The targets for GeneChip analysis were prepared according to the 
current Expression Analysis Technical Manual. Briefly, frozen lysates of the 
leukemia samples were thawed, homogenized (QIAshredder, Qiagen) and total 
RNA extracted (RNeasy Mini Kit, Qiagen). Normally 10 ug total RNA isolated from 

10 1 x 107 cells was used as starting material in the subsequent cDNA-Synthesis 
using Oligo-dT-T7-Promotor Primer (cDNA synthesis Kit, Roche Molecular 
Biochemicals). The cDNA was purified by phenol-chlorophorm extraction and 
precipitated with 100% Ethanol over night. For detection of the hybridized target 
nucleic acid biotin-labeled ribonucleotides were incorporated during the in vitro 

15 transcription reaction (Enzo® BioArray™ HighYield™ RNA Transcript Labeling Kit, 
ENZO). After quantification of the purified cRNA (RNeasy Mini Kit, Qiagen), 15 ug 
were fragmented by alkaline treatment (200 mM Tris-acetate, pH 8.2, 500 mM 
potassium acetate, 150 mM magnesium acetate) and added to the hybridization 
cocktail sufficient for 5 hybridizations on standard GeneChip microarrays. Before 

20 expression profiling Test3 Probe Arrays (Affymetrix) were chosen for monitoring of 
the integrity of the cRNA. Only labeled cRNA-cocktails which showed a ratio of the 
measured intensity of the 3' to the 5' end of the GAPDH gene less than 3 were 
selected for hybridization on HG-U95Av2 probe arrays (Affymetrix). Washing and 
staining the Probe arrays was performed as described. The Affymetrix software 

25 (Microarray Suite, Version 4.0.1) extracted fluorescence intensities from each 
element on the arrays as detected by confocal laser scanning according to the 
manufacturers recommendations. 
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EXAMPLE 2 - (C) Class separation by principal component analysis and 
hierarchical cluster analysis 

In a first step We reduced the dimensionality of the number of genes. Therefore we 
scaled the data from .each array to a target intensity value 50 (Affymetrix 
5 Microarray Suite) in order to be able to perform inter-array comparisons. Tljen all 
data was analyzed using Significance Analysis of Microarrays (Multiclass 
Response, Stanford University) and we selected 580 genes based on a 
permutations test. This reduced set of genes which showed to be significant then 
was analyzed using the public available Java application J-Express analysis tool 
10 (download at www.molmine.com). Principal Component Analysis and Hierarchical 
Cluster Analysis (parameters Cluster method: single linkage and Distance metric: 
euclidean) showed a-clear separation- of -analyzed groups of -samples e.g. healthy 
bone marrow versus leukemia. 

15 EXAMPLE 2 - (D) Identification of differentially expressed genes according to 

This analysis was carried cut as described in Example 1 (C) above. Briefly, 
classification of tumor samples was achieved by using a set of samples whose 

20 class had been already determined. This set was called training set. By using the 
oligonucleotide microarrays (Lockhart, D. J., et al., Nat Biotechnol 14 (1996) 1675- 
80), the, transcript levels in training set samples were measured for those genes 
that were represented on the microarray. The values for "transcription strength" 
were determined by averaging the values of a set of probes which were compared 

25 to a set of nearly identical probes containing a single mismatch. This was 
performed by using; methods provided by the oligonucleotide array of Affymetrix 
Inc. 

EXAMPLE 2 - (E) Principle Components Analysis, Classifier and 

DecisionsTrees 

30 

In order to obtain comparable values between different samples, they had to be 
standardized first. The method followed that described (Lockhart, D. J., et al., Nat 
Biotechnol 14 (1996) 1675-80), except that correcting for (additive) background 
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had been omitted. In brief, the data from one of the samples were declared to 
serve as a "standard", and the values from all other samples were adapted to this 
standard. For every possible comparison to this standard, a set of "reliable" values 
was determined by calculating the correlation coefficient for a series of intervals of 
5 increasing length. The lower bound of reliability was the bound of the interval that 
had a correlation coefficient less than or equal to the smaller intervals. From all 
reliable values, 2 (logarithmized) correction factor was calculated by computing the 
median of the differences of the logarithmic values. Values that were zero or 
negative prior to taking the logarithm were not taken into account. 

10 The obtained data matrix contained values from one sample per column. The gene 
expression profile across all samples for one gene or gene fragment represented 
on the oligonucleotide microarray was contained in a row of the matrix. To allow 
for rapid calculation of the classifier and to reduce memory usage, certain genes 
were pre-selected from the set of all genes represented on the array. The following 

15 criteria were applied: 



Formula (1): 

k k 

Formula (2): 

k 

k 

//1 refers to the average of the Ath class (i=l,...,k), jj to the total average, o\ to the 
20 standard deviation of the Ath class and t to an arbitrary treshold < 1 . Selection by 
these methods resulted typically in a reduction in the number of genes by a factor 
of 10-30. To check the quality of the selection procedure, the first two principal 
components (Jolliffe, Principle Components Analysis (1986), Springer (New York)) 
for the samples were plotted. This allowed to judge whether or not a rigorous 
25 discrimination was possible between the different classes. 
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For construction of the classifier, decision trees (Breiman et al., Classification and 
regression try, Wadsworth & Brooks/Cole (Monterey)) were used. Simple decision 
trees that discriminate between n classes by using only transcription levels for (n-l) 
genes were used. They were trained and the selected genes were the discarded 
5 from the original data set. A new tree was constructed by using the truncated data 
set and the entire procedure was iterated until a predetermined number of trees 
was reached. The optimal number of trees could be estimated by counting the 
number of misclassifications of classifiers built from different numbers of trees. For 
this, an independent data set of cross-validation had to be used. The final vote of 

10 the multi-classifier was obtained by applying a vote-by-majority rule to the 
predictions o f the contained trees. In the example of the present invention 15 
decision trees had been used for the multi-classifier. This allowed perfect 
classification of 100% of the samples, discriminating between classes that were 
given by chromosomal aberrations. To estimate generalization properties, i.e. how 

15 accurate the classifier may perform on samples that have not been used for 
training, cross-validation had been used (Efron and Tibshirani, An introduction to 
the bootstrap (1993), Chapman & Hall (New York, London), pp. 237-247). 

EXAMPLE 2 - Results (Goltib Method) 

20 From this point of view it was found that a set of 1 7 genes was sufficient to 
distinguish distinct AML subtypes from each other with high precision (Tables 1). 
The classification model was able to identify the 4 morphologically and 3 
cytogenetically and molecular biological different subtypes AML with t(8;21), with 
t(15;17), and with inv(l6) (Figures la-b, 2). 

25 In conclusion by comparison of gene expression profiles of AML samples (3 tested 
genetic subtypes t(8;21), t(15;17) and inv(16)) genes could be identified which 
allowed a differentiation between each individual AML subtype in detail could be 
shown for the first time that these distinct abnormalities on the genomic level relate 
to a specific gene expression pattern. In other words, in the experimental setting 

30 the knowledge of the expression status of these designated genes was sufficient 
to predict the genetic abnormality and allows the diagnosis of specific genetically 
defined subtypes of AML (Table 1). 

Results of methods described in l(E) are shown in Table 2 and Figures 3a + b, 1/2 
and 4. 
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EXAMPLE 2 - II) Pair-wise comparisons between normal bone marrow, AML, 
ALL, CML, and CLL: By pair-wise comparisons gene expression profiles of 8 
cases of normal bone marrow, 48 AML, 9 ALL, 8 CML, and 7 CLL were evaluated. 
5 These led to the identification of subtype-specific genes (Tables 3-12. Figs. 5a-c, 
6a-c, 7a-c, 8a-c). 



EXAMPLE 2 - III) AML classified according to WHO proposal 

To allow classification of AML subtypes according to the new WHO proposal we 
10 used the gene expression profiles of four genetically defined AML subtypes 
(t(8;21) n= 9; t(15;17) n= 18; inv(16) n= 10; 11q23/MLL aberrations n= 11). This 
led to the identification of subtype-specific genes (Table 13, Figs. 9a-c). 



EXAMPLE 2 - IV) Normal bone marrow versus distinct genetic subtypes of 
1 5 AML: We used the gene expression profiles of normal bone marrow (n=8) and of 
four genetically defined AML subtypes (t(8;21) n= 9; t(15;17) n= 18; inv(16) n= 10; 
1 1q23/MLL aberrations n= 10). This led to the identification of genes that allow the 
distinction between normal bone marrow and each of the four AML subtypes 
(Table 14). 

20 

EXAMPLE 2 - V) Identification of genes specifically separating normal bone 
marrow, AML, ALL, CML, and CLL: : We used the gene expression profiles of 
normal bone marrow (n=8) and of AML (n=48), ALL (n = 9), CML (n = 8), and CLL 
(n =7). This led to the identification of xx genes that allow the distinction between 
25 normal bone marrow and each of the four leukemia subtypes (Table 15, Figures 
10a-c). 
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Example 3: Gene expression profiling provides a global and 
robust diagnostic tool for leukemia 

Example 3- Introduction 

The expression profiles of 12,600 genes were analyzed in 103 patients suffering 
5 from chronic myeloid leukemia (CML), chronic lymphoid leukemia (CLL), acute 
lymphoblastic leukemia (ALL), and acute myeloid leukemia (AML). A set* of 71 
genes shown in table 16 to 19 was identified as the minimal set necessary to 
accurately diagnose prognostically relevant leukemia subtypes and to distinguish 
these from normal bone marrow (BM, n=8). Thus, microarray technology is a 

10 suitable method for diagnosis of leukemia. 

Today, the classification of hematological malignancies according to the WHO 
criteria describes chronic myeloid leukemia (CML), chronic lymphoid (CLL), acute 
lymphoblastic (ALL), and acute myeloid leukemia (AML). Within the latter two 
several prognostically relevant subtypes are established (see example 4). This 

15 subclassification is based on genetic abnormalities of the leukemic blasts 
associated with different prognoses and becomes increasingly important to guide 
therapy. Thus, the development of new, specific treatment approaches requires 
the precise identification of these subtypes that may benefit from individual 
therapeutic protocols. It has already been shown that the development of drugs 

20 targeting molecular aberrations can dramatically improve outcome. The 
introduction of all-trans retinoic acid (ATRA) into the treatment of AML with 
t(15;17)(q22;q11-12) has improved outcome from about 50% to 80% long-term 
survivors (1). In CML patients imatinib, a designed molecule that inhibits the 
t(9;22)(q34;q11) specific chimeric tyrosine kinase BCR-ABL, induces dramatically 

25 higher response rates as compared to conventional drugs (2). To fully take 
advantage of specific treatment options a precise identification of distinct leukemia 
subtypes is mandatory. However, standard diagnostics of leukemia using a 
combination of complementary methods is expensive, time-consuming, and 
requires experienced specialists. 

30 Since its introduction, microarrays have been promising tools for basic research. 
With regard to leukemia, the pivotal discrimination of unselected ALL and AML 
samples based on their gene expression signatures inspired numerous studies (3). 
Recently, subtypes of childhood ALL could be correlated to specific gene 
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expression profiles leading to. both marker genes suitable for initial diagnostics and 
canditates as predictors for outcome (Yeoh, Eng-Juh. pediadric ALL expression 
profiling Cancer Cell, 2002). Additionally, novel entities in hematological 
malignancies could be identified based on their distinct expression pattern as has 
5 been shown for multiple myeloma, large cell lymphoma, and childhood ALL (4-6). 
In example 4, it is demonstrated that cytogenetically defined AML subtypes can be 
correlated to specific gene expression profiles (see example 4). AML FAB M2 with 
t(8;21)(q22;q22), FAB M3/M3v with t(15;17)(q22;q11-12), or M4eo with 
inv(16)(p13q22) could be classified based on a minimal set of 13 genes. These 
10 genes belong to a large variety of different functional classes including members 
of signaling pathways, cell surface antigens, as well as plasma glycoproteins. 
Several genes are known to be involved in cytoskeletal structure, transcriptional 
processes, or have not yet further been functionally described. 

Here, gene expression profiles of 103 leukemia patients were acquired 

15 representing 11 groups and eight normal BM donors to designate leukemia- 
specific genes which build up the basis for a novel diagnostic tool. Following the 
aims of Golub, who introduced the cancer class prediction methodology (3, 7), all 
four major leukemia types were analyzed and also included cytogenetically 
defined subgroups of AML and ALL as described in the WHO classification of 

20 leukemia, respectively (Fig. 11a). All patient samples were thoroughly 
characterized combining cytomorphology, cytogenetics, immunophenotyping, and 
molecular genetics. This was a prerequisite to obtain disease-specific gene 
expression profiles for each entity. We used Affymetrix expression probe arrays 
HG-U95Av2 to interrogate the mRNA abundance of approximately 12,600 

25 transcripts. In order to identify genes suitable for a leukemia prediction classifier 
we applied a slightly modified prediction methodology as introduced by Golub [see 
(Note1_Golub method)]. A minimal set of candidate genes had to show both 
maximal classification accuracy and maximal confidence. Accuracy of the 
classifiers was determined by permutation-based neighborhood analysis [see 

30 (Note2__ leave-one-out crossvalidation)]. Additional information about the absolute 
differences of expression intensities and further descriptions of all candidate genes 
can be found in the supporting online material. 
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In a first step, based on 23 informative genes the samples were assigned to either 
normal BM, CLL, CML, ALL, or AML, respectively (Table 22; Description of Table 
22: Classification scheme for 4 major leukemia types and normal BM. Matrices 
delineate distribution of actual leukemia types as compared with predicted types 
5 from pairwise comparisons. Class assignment can be based on the expression 
profiles of 23 genes. Except for pairwise comparison of AML versus ALL, all, cases 
can be predicted accurately in leave-one-out cross validation with 100% accuracy 
and strong confidence. For each pairwise comparison the minimal set of 
informative genes is represented by approved HUGO Gene Nomenclature 

10 Committee (HGNC) symbols. Not yet approved genes are marked by asterisks.). 
In 9/10 pairwise comparisons all samples were classified correctly (335 individual 
assignments; 100% accuracy). In one comparison (AML versus ALL) 75/77 
samples were classified correctly resulting in an accuracy of 97%. Two ALL 
samples were misclassified as AML. This may be due to the heterogenity of both 

15 groups (n=18 versus n=59) causing noise in the expression data. 

For each pairwise comparison a set of discriminative genes is disclosed in table 20 
whereby the gene names can be found in table 21 . The most discriminative and 
informative genes are marked by asterisks in table 20 and are the 71 genes shown 
in table 16 to 19 

20 In detail, we found phospholipidscramblase 1 (PLSCR1) to be lower expressed in 
AML and ALL as compared to normal BM. PLSCR1 encodes for a plasma 
membrane protein and has been proposed to play a role in transbilayer migration 
of phospholipids and in recognition and phagocytic clearance of injured, aged, or 
apoptotic cells (8). The biologic effects of interferon-alpha may be mediated by 

25 PLSCR1 which is markedly upregulated by interferon (9, 10). We also observed 
that LEF-1 was absent in myeloid leukemias but highly expressed in lymphoid 
leukemias. LEF-1 was shown to be mitogenic and important for cell survival in pro- 
B cells (11). The B-cell specific coactivator of octamer binding transcription factors, 
POU2AF1, plays an important role in the antigen-driven stages of B cell activation 

30 and maturation and discriminates between AML and CLL (12). MSF has been 
reported to be a translocation partner of the mixed-lineage leukemia gene (MLL) in 
AML and was able to separate AML from ALL (13). Likewise, OS-9, not yet further 

< 
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functionally described except for amplification in osteosarcomas, was differentially 
expressed between AML and ALL (14). HLA-DMB plays a critical role in antigen 
presentation by catalyzing the release of class II HLA-associated invariant chain 
binding sites for acquisition of antigenic peptides (15). It is known that lymphocytes 
5 in CLL express high levels of class II antigens whereas mature myeloid leukemias 
are e.g. HLA-DR negative (16, 17). Therefore, the differential expression of HLA- 
DMB in CML as compared to CLL illustrates well the differential expression of cell 
surface MHC class II molecules. NCOA1 plays a critical role in STAT3 and STAT6 
pathways and was expressed higher in CLL as compared to ALL suggesting an 

10 inhibitory effect of STAT6-mediated transactivation in CLL (18). A member of the 
tumor necrosis factor receptor family, whose surface expression has already been 
reported in CLL (19), CD27, was identified to assign samples either ALL or CLL. 
- - We also detected LCN2~ that was shown to be a modulator of inflammation 
regulated by interleukin-9 with highest expression in CML samples (20). IRF4, an 

15 immune system-restricted interferon regulatory factor that is required for 
lymphocyte activation showed no expression in CML while it was expressed in 
normal BM. Recently, an increase of IRF4 levels in CML patients demonstrated an 
association with a good response to interferon-alpha therapy (21). Several other 
proteins (DEFA3, SGP28, CAMP, CLC) are known to be stored in the granules of 

20 neutrophils and allowed assignment of leukemic samples to the CML type if highly 
expressed (22-25). 

The second step of our approach was to build up a classifier for the identification 
of AML subtypes genetically defined according to the WHO classification, i.e. AML 
with t(8;21), with t(15;17) with inv(16), and with 1 1 q23-translocations involving the 

25 MLL gene, respectively. In addition, a category 'other' was analyzed comprizing 
AML with normal karyotype (n=3), AML with complex aberrant karyotype (n=4), 
and AML with trisomy 8 as sole abnormality (n=3), respectively. A set of 25 most 
informative genes was identified based on pairwise comparisons and one-versus- 
all (OVA) comparisons. None of these genes had already been identified for the 

30 classification of the four leukemia types and normal BM as described above. As 
shown in Figure 11b, distinct AML subgroups cluster together due to 
homogeneous expression profiles. This classification model showed 100% 
classification accuracies in 14/15 comparisons (440 individual assignments). In 
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one OVA comparison, 'other 1 versus all other AML, 54/55 samples were assigned 
correctly. The missclassification of one sample may also reflect the large 
heterogenity of both groups. 

The following genes were identified in OVA comparisons and discriminate distinct 
5 AML subtypes. The gene most valuable for prediction of AML M4eo with inv(16) 
was MYH11, Its higher expression as compared to all other AML most probably is 
due to hybridization of the M4eo-specific fusion transcripts CBF3-MYH11 to 
corresponding A/7V7-/7 7-oligonucleotides represented on the microarray (26). 
Likewise, the higher expression of CBFA2T1 (formerly ETO) in AML with t(8;21) 

10 may be due to a similar effect of hybridization of the subtype-specific AML1-ETO 
fusion transcript (27). Another highly characteristic gene for t(8;21) positive AML 
— was-POL/4^7, -which -has- been- described to play an important role in retinal 
ganglion cell differentiation and has recently been shown to confer an oncogenic 
potential when co-transfected with H-RAS (28). Furthermore, it was shown to be 

15 highly expressed in neuro-epithelioma and ewing sarcomas (29). Another member 
of this transcription factor family, POU2F2, was able to discriminate between 
t(11q23)/MLL versus group 'other'. A related gene, POU2AF1, has recently been 
reported to be underexpressed in acute leukemia with t(11q23)/MLL- 
rearrangement (5). The most informative genes in our approach discriminating 

20 AML with t(1 1q23)/MLL-rearrangement from all other AML subtypes were SOCS-2 
and MBNL. Generally, SOCS-2 shows a higher expression level in AML with 
t(11q23)/MLL-rearrangement and is known to play a role in cytokine-induced 
signaling pathways (30). Similarly, MBNL shows a higher expression in AML with 
t(11q23)/MLL-rearrangement as compared to all other AML samples. Its encoded 

25 protein as well as other MBL family members are localized in the nucleus and 
share a Cys3His zinc finger motif (31). MBL proteins occur in several isoforms due 
to alternative splicing (32) and may have different functions as has been shown for 
HOX genes (33). HOXA9 has been reported to be highly expressed in leukemia 
with MLL-rearrangements (5). In contrast, expression of HOXB5 \s characteristic of 

30 AML group 'other' as compared to all other AML subtypes in our data. The most 
important genes discriminating AML with t(15;17) from all other AML subtypes 
were ARGHGAP4 and CTSW. ARGHGAP4 is predominantly expressed in 
hematopoietic cells but showed a lower expression level in AML with t(15;17) as 
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compared to all other AML subtypes. It encodes a member of signaling proteins 
involved in regulation of small GTP-binding proteins of the RAS-superfamily, which 
themselves play an important role in cell cycle and apoptosis (34). CTSW encodes 
for a recently described papain-like cysteine protease, which is predominantly 
5 expressed in NK cells and to a lesser extent in cytotoxic lymphocytes. It may 
represent a putative component of the endoplasmatic reticulum resident proteolytic 
machinery (35). A survey about the expression levels of genes in the AML 
subtypes can be found in Fig. 12a-d 

Subclassification of ALL comprizing the three B-lineage groups ALL with t(9;22), 
10 with t(4;11), or with t(8;14) was analyzed next and compared with T-lineage ALL 
expression profiles. All samples were classified correctly on the basis of 19 genes 
_ (FigriTc)."TrTis "set" included " TRS, which was already described to-distinguish 
between CLL and CML (Table 22). 

In detail, the genes encoding for the T cell receptor beta subunit and T cell surface 

15 CD3 delta chain (TRB, CD3D) were identified as' highly indicative of T-ALL as 
compared to both ALL with t(9;22) and all other ALL subtypes. This is in line with 
standard diagnostics of T-ALL by immunophenotyping where these antigens 
comprize the most specific ones (36). Similarly, MME (formerly CD10) was highly 
expressed in ALL with t(9;22) only. This on the one hand may reflect that t(9;22) is 

20 observed in common-ALL and in pre-B ALL only. On the other hand, this data 
again demonstrates that the gene used for diagnostic purposes in flow cytometry, 
MME, may be highly indicative of these ALL subtypes in comparisons to the more 
immature B-lineage ALL, i.e. pro-B ALL, as well as the mature B-ALL and the T- 
ALL. Furthermore, the identification of connective tissue growth factor (CTGF) as a 

25 specific marker for ALL with t(4;11) adds to previous data demonstrating its 
increased gene expression in childhood ALL in general (37). The glucocorticoid 
receptor beta has been shown to be highly expressed in ALL with t(4;1 1) but not in 
t(9;22) positive ALL. This is in line with the particularly poor prognosis of the latter 
subtype since response to corticoid therapy is one of the most powerful prognostic 

30 factors in ALL (38, 39). In addition, we speculate that new treatment options may 
be realized for T-ALL based on the high expression of adenosine deaminase 
(ADA) in this subtype. Inhibitors of ADA have been shown to be effective in 
indolent T-cell lymphomas but have not yet been evaluated in T-ALL (40). One 
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cytokine differentially expressed between t(8;14) positive ALL and T-lineage ALL 
was SCYA3. We recommend testing the monitoring of its protein expression as a 
supplemental antigen useful for immunophenotypical identification of t(8;14) 
positive ALL. Finally, in ALL carrying t(4;11) v-myb is highly expressed and may 
5 thus be involved in the pathogenesis of this subtype. In general, a role of v-myb 
has been described for the transformation of myelomonocytic cells (41). A survey 
about the expression levels of genes in the AML subtypes can be found in Fig. 
12e-12i. 

At least, we intended to separate t(9;22) positive from t(9;22) negative ALL. Our 
10 data contained two genes encoding for ADCY3 and the hypothetical protein 

KIAA 1013 which were sufficient for the 100% correct assignments of 18 analyzed 

cases. Both genes showed a higher expression in t(9;22) positive as compared to 
-t(9;22)~ne"gative~ALL~Additionallyr distinguishing- B-lineage - from T-lineage- ALL, 

CD3D and TRB repeatedly showed their usefulness as T-ALL marker genes as 
1 5 already described in Figure 1 1 c (1 8/1 8 correct individual assignments). 

Generally, chromosomal aberrations are strongly associated with morphological 
characteristics. However, there are two chromosomal aberrations which are 
observed in both myeloid and lymphatic neoplasms, i.e. t(11q23)/MLL and the 
t(9;22). The t(9;22) occurs in ALL and CML, and t(11q23)/MLL is observed in ALL 

20 and AML, respectively. Analyzing gene expression signatures of both t(9;22) 
positive ALL and CML we identified two genes, which allowed 17/17 correct 
lineage assignments. CD74 plays a critical role in MHC class II antigen processing 
and demonstrated a lower expression in t(9;22) positive CML (42). This may also 
explain the relationship between the low MHC class II antigen presentation in CML 

25 in general and fits well to the recognized lower HLA-DMB expression in CML as 
compared to CLL (Table 1). CAPN3 is a member of the papain superfamily and 
was higher expressed in CML discriminating them from t(9;22) positive ALL [see 
(Note_ 38894_g_at)]. 

In addition, our results indicate that the expression signatures of two genes, CD24 
30 and CTGF, are sufficient for 14/14 correct assignments of the t(11q23)/MLL 
positive leukemias either to ALL or to AML. Thus, in both scenarios lineage 
assignment can be accomplished based on specific gene expression signatures 
despite the same underlying chromosomal aberrations. 
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Taken together, these data demonstrate the utility of gene expression profiling 
using microarrays for diagnosis of leukemia. In total, 1 1 different leukemia entities 
could clearly be distinguished from each other and from normal BM, respectively. 
These leukemias are associated with highly differing prognoses and require 
5 specific treatment strategies. By performing these analyses on a single platform 
requiring basic molecular biological methods, this approach guarantees broad 
access to high-quality diagnosis of leukemia. The robust gene expression analysis 
with high diagnostic accuracy can substitute the combination of cytomorphology, 
cytogenetics, immunophenotyping, and molecular biological methods used today. 

10 Compared to the combination of methods used so far, this approach also reduces 
costs. In order to introduce diagnostical genomics into routine clinical practice, 
prospective trials in parallel to conventional methods are necessary to prove the 
reliability in-a large cohort- of-patients. -Furthermore, gene expression patterns will 
allow the additional subclassification of leukemia especially in subtypes with no 

15 specific cytogenetic markers and the identification of deregulated master genes 
within distinct leukemia entities can guide the way to new therapeutic approaches. 
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Notes of Example 3 

[see (Notel__GoIub method)] 

When comparing two groups of microarray experiments, Golub's method sorts the 
15 genes with respect to the signal-to-noise ratio of gene x: S x = (//i-//2)/(cri+a 2 ), 
where jj^ and Ok denote the mean expression and standard deviation of gene x in 
group k. According to a specified number of "informative" genes (e.g. 20) the best 
discriminating genes are selected. For each informative gene a decision limit is 
calculated as b x = (//i+// 2 )/2. To classify a new sample of an independent test set, 
20 the gene expression levels of informative genes are taken and for each gene x 
and sample y a so-called vote is calculated as V x = S x (g x y - b x ), where g x y denotes 
expression level of gene x in sample y. The votes of all informative genes are 
summed up ("weighted voting") and depending upon the sign of this sum the new 
sample is classified as group 1 or group 2. The confidence in the prediction is 
25 calculated as |Z V x / I |V X | | . 

However, the decision limit proposed by Golub does not provide optimal 
classification accuracy in all situations. Importantly, when the standard deviation of 
expression levels within the two groups are very different, the decision limit is 
biased towards the group with the higher standard deviation. A decision limit for a 
30 particular gene can be considered optimal, if it achieves maximum classification 
accuracy for a given dataset. By determining systematically classification 
accuracies for a set of possible decision limits, an optimal decision limit can be 
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calculated. We selected an optimal decision limit from the following set of decision 
limits L x : L x = { (g x y + g x y " 1 )/2 | 1 < y <= n } where g x y denotes expression level of 
gene x in sample y, n denotes the total number of samples in the training set. 

Additionally, we applied an heuristic approach to select a minimal set of 
5 discriminative genes, which provides maximum classification accuracy in leave- 
one-out-crossvalidation. We applied for a given set of 20 informative genes 
weighted voting as described above and the classification accuracy was calculated 
by crossvalidation. Therefore, our algorithm consists of the following steps: (i) 
Calculate the top 20 discriminating genes according to the signal-to-noise ratio, (ii) 

10 Calculate classification accuracy and confidence based on optimal decision limits 
for each of the top 20 genes (iii) Select the gene which provides best classification 
accuracy and c onfidence out of step 2. (iv) Test for each of the remaining 19 
genes, whether adding this gene to the model improves accuracy and confidence; 
if the gene improves accuracy and confidence, it is added to the weighted voting 

15 model, otherwise it is disparded. 

In detail, this method can be described as follows: 

Example 3 - Subheading to Note1_Golub method: Abstracts 

Differentially expressed genes can potentially be used in medical diagnostics, if 
20 the gene expression patterns are reliable and specific for a particular disease. 
diffgenes is a program to identify differentially expressed genes in microarray 
experiments. Its algorithm is based on the method proposed by Golub, but 
contains two improvements: an optimized decision limit per gene and a minimal 
set of discriminative genes. 

25 The new method was applied to a human dataset from the domain of cancer 
research consisting of 103 microarrays with 12625 genes each, diffgenes 
outperforms Golub's method clearly both in terms of accuracy and confidence of 
classifications. The biological validation of the results is facilitated, because 
diffgenes identifies a very small number of candidate genes (typically < 5). 

30 Microarray datasets can be analyzed with diffgenes on the Internet at 
http://martin-dugas.de/diffgenes/ 

Example 3 - Subheading to Notel Golub method: Introduction 
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Microarrays are used in ongoing research to characterize disease processes on a 
molecular level. Gene expression analysis enables to identify new subtypes within 
known diseases with prognostic relevance for the patients [Alizadeh 2000]. 

For interpretation of the wealth of data - more than 10.000 parameters per 
5 experiment - it is advisable to integrate microarray data with detailed clinical 
information. For applications in medical diagnostics, significant assocfations 
between gene expression profiles and sample groups resulting in classification 
accuracies in the range of 70 - 80% are not sufficient; for diagnostic purposes at 
least 95% classification accuracy is required. 

10 If a certain disease is characterized by a specific gene product, e.g. a pathologic 
fusion gene, a precise measurement of the expression of this particular gene 
should be a reliable marker for the disease. Therefore in a diagnostic setting, very 
few and specific genes would be desirable. 

However, for many diseases the precise molecular pathogenesis is not yet known. 
15 In addition, the function of many genes on currently available microarrays like 
Affymetrix GeneChip R is still unclear. 

Therefore microarray data should be analyzed and interpreted carefully. By 
integration of data from different diagnostic modalities (morphology, PCR, FISH, 
clinical data) the biological plausibility and consistency of microarray data can be 
20 verified. 



Example 3 - Subheading to Notel Golub method: Methods 
Example 3 - Subheading to Notel Golub method: Golub's method 

When comparing two groups of microarray experiments, Golub's method sorts the 
25 genes with respect to the signal-to-noise ratio of gene x: S x = Gui-ju 2 )/(o-i+o 2 ), 
where /v k and o« denote the mean expression and standard deviation of gene x in 
group k. 

According to a specified number of "informative" genes (e.g. 20) the best 
discriminating genes are selected. For each informative gene a decision limit is 
30 calculated as b x = (^/ 1 +/y 2 )/2. To classify a new sample of an independent test set, 
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the gene expression levels of informative genes are taken and for each gene x 
and sample y a so-called vote is calculated as V x = S x (g x y - b x ), where g x y denotes 
expression level of gene x in sample y. The votes of all informative genes are 
summed up ("weighted voting") and depending upon the sign of this sum the new 
5 sample is classified as group 1 or group 2. The confidence in the prediction is 
calculated as' |Z V x / I |V X | | Jo assess the significance of each gene, a 
permutation test is performed, which determines signal-to-noise ratios when class 
labels are permuted randomly. To assess the robustness of the classifier, a leave- 
one-out crossvalidation is performed. Accuracy is the rate of correctly classified 
10 test samples. Further details are contained in [Golub 1999], [Pomeroy 2002, 
Supplement]. 



Example 3 - Subheading to Notel Golub method: An optimized decision 
limit 

15 The decision limit proposed by Golub does not provide optimal classification 
accuracy in all situations. As can be seen in Figure 13a, when the standard 
deviation of expression levels within the two groups are very different, the decision 
limit is biased towards the group with the higher standard deviation. 

A decision limit for a particular gene can be considered optimal, if it achieves 
20 maximum classification accuracy for a given dataset. By determining 
systematically classification accuracies for a set of possible decision limits, an 
optimal decision limit can be calculated. The diffgenes program selects an optimal 
decision limit from the following set of decision limits L x : 

U = {(g x y +gx H )/2|i <y<=n} 

25 where g x y denotes expression level of gene x in sample y, n denotes the total 
number of samples in the training set. 



Example 3 - Subheading to Note1_Golub method: A minimal set of 
discriminative genes 
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Golubs method selects an arbitrary number of "informative" genes to discriminate 
between two classes of samples according to their signal-to-noise ratio, typically in 
the range of 10 to 50 genes. Choosing too many genes carries the risk of 
overfitting, which causes poor generalization features of the model. Therefore 
5 diffgenes applies an heuristic approach to select a minimal set of discriminative 
genes, which provides maximum classification accuracy in leave-one-out- 
crossvalidation. I.e. for a given set of genes weighted voting as described by 
Golub is applied and the classification accuracy is calculated by crossvalidation. 



The diffgenes algorithm consists of the following steps: 

10 1. Calculate the top 20 discriminating genes according to the signal-to-noise 
ratio 

2. Calculate classification accuracy and confidence based on optimal decision 
limits for each of the top 20 genes 

3. Select the gene which provides best classification accuracy and confidence 
15 out of step 2. 

4. Test for each of the remaining 19 genes, whether adding this gene to the 
model improves accuracy and confidence; if the gene improves accuracy 
and confidence, it is added to the weighted voting model, otherwise it is 
discarded. 



20 



Example 3 - Subheading to Notel Golub method: Results 



The method was applied to a new human dataset from the domain of cancer 
research consisting of 103 Affymetrix Genechip(R) microarrays with 12625 genes 
each. Table 23 presents an analysis of 18 samples class A versus 85 samples 

25 class non-A (Description of Table 23: Analysis of 1 8 samples class A versus 85 
samples class non-A. On the left the analysis according to Golub is presented for 
20 informative genes. The crossvalidation accuracy is 0,87, confidence 0,77. 
Samples, where crossvalidation failed, are listed. For each gene signal to noise 
ratio, p-value (significance obtained from permutation test) and decision limit are 

30 provided. On the right the same data set is analyzed using diffgenes. By selection 
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of 3 genes (marked with asterisks) out of the top 20 genes and selecting optimized 
decision limits, the crossvalidation accuracy reaches 0,96, confidence 0,88.). 
Based on 20 informative genes Golub's method results in a crossvalidation 
accuracy of 0,87 (confidence 0,77); diffgenes achieves with three genes out of the 
5 top 20 set a crossvalidation accuracy of 0,96 (confidence 0,88). The same 
analysis was performed for one versus all (OVA) and all pairs (AP) comparisons in 
this dataset consisting of 5 different classes. Figure 13b presents accuracy and 
confidence obtained by both methods: diffgenes outperforms Golub's method 
clearly both in terms of accuracy and confidence of classifications. The same 
10 - comparative approach was applied to two datasets in cardiology and cell biology 
consisting of 44 and 67 microarrays. The results concerning Golub's method and 
diffgenes were very similar (data not shown). 
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Example 3 - Subheading to Notel Golub method: Discussion 
There are two major challenges in the analysis of microarray data: the number of 
variables (genes) is much higher than the number of individual samples and the 
5 correlation structure of the parameters is widely unknown. 

Golub's method to analyse microarray data has been applied to important medical 
datasets [Armstrong 2002]. Recently many different approaches have been 
applied to microarray data: Classical statistical techniques like ANOVA with 
adjustment for multiple testing, significance analysis of microarrays (SAM) [Tusher 
10 2001] , selection of discriminative genes with support vector machines (SVM), 
neural networks and many more. This indicates that the underlying problem is 
important and non-trivial; a comparison of different methods is needed. 
Robustness of the generated mathematical models is an important issue, therefore 
bootstrap procedures and permutation tests are applied. 

15 For medical diagnostics differentially expressed genes are of interest, but the 
sensitivity and specificity for particular diseases must be validated prospectively in 
larger patient cohorts, diffgenes is an extension of Golub's method to improve 
classification accuracy, which is very relevant in a diagnostic setting. The 
optimized decision limit plays an important role, because the situation presented in 

20 Figure 13a is quite common in biological contexts: group 1 represents samples, 
where the expression of gene x is repressed while gene x is activated in group 2. 
The biological validation of the results is facilitated, because diffgenes identifies a 
very small number of candidate genes (typically < 5). 

Emphasis must be placed on verification of results by other diagnostic procedures, 
25 because the selected "important" genes are not only dependent on the statistics 
procedure, but also on the preprocessing of data. In our setting by integration of 
microarray analysis with other laboratory modalities (morphology, cytogenetics, 
molecular genetics, immunphenotyping) and clinical data the plausibility and 
consistency of results could be evaluated, therefore we are optimistic, that the 
30 demanding requirements for medical diagnostics can be fulfilled with microarray 
technology in the near future. 
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EXAMPLE 3 - [see (Note2_ leave-one-out cross validation)] 

To assess the significance of each gene, a permutation test is performed, which 
determines signal-to-noise ratios when class labels are permuted randomly. To 
5 assess the robustness of the classifier, a leave-one-out crossvalidation is 
performed. Accuracy is the rate of correctly classified test samples. 

EXAMPLE 3 - [see (Note_ 38894_g_at)] 

The second top-ranked gene was represented by the Affymetrix probe set 
10 identifier: 38894__g_a. However, no clear gene assignment was possible for this 
informative prove set. Therefore, CAPN3 was chosen. 
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Example 4: PNAS 

EXAMPLE 4 - ABSTRACT 

Acute myeloid leukemia (AML) is a heterogeneous group of genetically defined 
diseases. Their classification is important with regard to prognosis and treatment. 
5 We performed microarray analyses for gene expression profiling on bone marrow 
samples of 37 patients with newly diagnosed AML. All cases had either of the 
distinct subtypes AML M2 with t(8;21), AML M3 or M3v with t(15;17), or AML M4eo 
with inv(16). Diagnosis was established by cytomorphology, cytogenetics, 
fluorescence-in-situ hybridization, and RT-PCR in every sample. By using two 
10 different strategies for microarray data analyses, this study for the first time 
revealed a unique correlation between AML-specific cytogenetic, aberrations and 
gene expression profiles. 



EXAMPLE 4 - INTRODUCTION 

15 Acute myeloid leukemia (AML) is a heterogeneous group of diseases with respect 
to biology and clinical course. Since the introduction of the FAB-classification in 
1976 diagnosis and classification have been based on cytomorphology and 
cytochemistry(l). As other techniques like immunophenotyping, cytogenetics, and 
molecular genetics contributed to the definition of AML subtypes the FAB- 

20 classification was updated. In 1999 the WHO classification for tumors of 
hematopoietic and lymphoid tissues was> proposed. In an attempt to define 
biologically homogeneous entities which have clinical relevance morphologic, 
immunophenotypic, genetic and clinical features were incorporated(2, 3). 

For optimal treatment approaches both a precise diagnosis and prognostic 
25 parameters that determine response to therapy and survival are needed. So far, 
the karyotype of the AML blasts is the most important independent prognostic 
factor. A favorable outcome under currently used treatment regimens with cure 
rates from 50% up to 85% was observed in several studies in patients with a) 
t(8;21)(q22;q22) occuring mostly in FAB subtype AML M2, b) inv(16)(p13q22) 
30 associated with AML M4eo and c) t(15;17)(q22;q1 1-12) associated with AML M3 
and AML M3v(4-6). In contrast, chromosome aberrations with an unfavorable 
clinical course are -5/del(5q), -7/del(7q), inv(3)/t(3;3) and complex aberrant 
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karyotypes with cure rates of less than 10%(7, 8). The remainder AML patients are 
assigned to a prognostically intermediate group. This latter group is very 
heterogeneous because it includes patients with a normal karyotype as well as 
those with rare chromosome aberrations and yet unknown prognostic impact. 

5 Besides their prognostic impact genetic aberrations are involved in the 
pathogenesis of leukemia. While for unbalanced cytogenetic aberrations the 
heterogeneous pathogenetic mechanisms have not yet conclusively been 
determined, several studies provide strong evidence for the central pathogenetic 
role of leukemia-specific fusion genes that are generated by the above mentioned 
10 balanced abnormalities(9-12). Therefore it can be postulated that AML with 
balanced abnormalities most probably display a homogeneous gene expression 
profile and thus are promising candidates for microarray analyses. 

In a pivotal study, gene expression profiles were analyzed in bone marrow 
samples of 27 ALL and 11 AML. A set of 50 genes out of 6,817 analyzed genes 

15 was sufficient to discriminate ALL and AML. By leave-one-out cross-validation it 
was possible to correctly classify 36 out of 38 acute leukemia cases. A class 
predictor could automatically determine new leukemia cases out of an 
independent test set as belonging to the myeloid or the lymphoid lineage. Thus, 
these results demonstrated the possibility of cancer classification based on gene 

20 expression profiling(13). In a further approach comparing AML with trisomy 8 and 
AML with normal karyotype expression profiling revealed fundamental biological 
differences in AML with isolated trisomy 8 and normal cytogenetics(14). More 
recently, acute lymphoblastic leukemias (ALL) with translocations involving the 
MLL gene could be separated from ALL cases without MLL translocations and 

25 from cases with AML by gene expression profiling(15). 

The aim of our investigation was to answer the question whether a leukemia 
specific genotype is associated with a distinct gene expression profile. Therefore, 
we analyzed three distinct genetic subtypes of acute myeloid leukemia: 
t(8;21)(q22;q22), inv(16)(p13q22) and t(15;17)(q22;q12) which lead to subtype 

30 specific fusion genes AML1-ETO, CBFB-MYH11 and PML-RARA, respectively. 
They are specifically associated with four distinct morphological subtypes 
according to the FAB-classification: AML M2, AML M4eo, AML M3 and AML 
M3v(16-18). We performed microarray analyses on a cohort of leukemia samples 
(n=37) and applied several methodologies to evaluate genes which allowed an 

35 assignment to the corresponding type of cytogenetic aberration for classification. 
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This is the first time that AML-specific cytogenetic aberrations can be correlated 
with corresponding gene expression profiles and vice versa. 



EXAMPLE 4- METHODS 

5 Example 4- Selection and characterization of leukemia samples 

For this investigation we selected bone marrow (BM) samples from 37 AML 
patients representing four morphological and three underlying cytogenetic 
subgroups. All cases were sent for reference diagnostics to our laboratory and 
registered in our leukemia database(19). Samples were received either locally or 
10 by overnight mail. All samples were newly diagnosed de novo AML and were 
characterized by cytomorphology, cytogenetics, FISH, and molecular genetics in 
each case. Gene expression analyses were performed on cells remaining from the 
diagnostic samples. Samples had been lysed immediately, frozen and were stored 
at -80°C from one to 34 months until preparation for gene expression analysis. 

1 5 Example 4- Cytomorphology 

Analysis was based on May-Grunwald-Giemsa stain, myeloperoxidase reaction, 
and non-specific esterase reaction using alpha-naphthyl-acetate. All staining from 
bone marrow and blood was performed routinely according to standard 
procedures(20). The cytomorphologic diagnosis followed the criteria of the FAB 
20 classification and the new WHO classification (1 , 3, 18). 

Example 4- Cytogenetics 

Chromosome analyses were performed on bone marrow or peripheral blood 
samples according to standard protocols(21). Metaphases were analyzed for G- 
bands using a modified GAG-banding technique as described elsewhere(22). 
25 Twenty to 25 metaphase cells were analyzed. The chromosomes were interpreted 
according to the International System for Human Cytogenetic Nomenclature(23). 



Example 4- Fluorescence in situ hybridization (FISH) on interphase nuclei 
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FISH was performed on interphase nuclei on bone marrow smears or on slides 
prepared for cytogenetic analysis. For interphase-FISH at least 100 interphase 
nuclei were evaluated. FISH was carried out using commercially available AML1- 
ETO, PML-RARA and CBFB probes (VYSIS, Downers Grove, II, USA). The 
5 signals were evaluated with an Axioskop R (Zeiss, Jena, Germany). For 
documentation the analyzing system ISIS R (MetaSystems, Altlussheim, Germany) 
was used. 



Example 4- RNA isolation and Reverse-transcription-polymerase-chain- 
10 reaction (RT-PCR) 

Mononuclear cells were isolated by a Ficoll gradient separation. 1x10 7 cells were 
lysed in RLT-buffer (Qiagen, Hilden, Germany) and total RNA was extracted with a 
RNeasy-kit (Qiagen) according to the manufacturers instructions. RNA was eluted 
in 50 //I of elution buffer. 

1 5 Five jj\ of the total RNA, an equivalent quantity of 1 x1 0 6 cells or about 1 fjg of RNA 
were reversely transcribed in a 40 jj\ reaction using 300 U of Superscript" 
(LifeTechnologies, Karlsruhe, Germany) and random hexamers (Pharmacia, 
Freiburg, Germany). 

PCR for the specific AML1-ETO, CBfB-MYHH, or PML-RARA fusion transcripts 
20 were performed as has been described(24). For each sample an ABL specific RT- 
PCR was performed to control the integrity of RNA using primers ABL5': 5'- 
GGCCAGTAGCATCTGACTTTG-3' and ABL3': 5'- 

ATGGTACCAGGAGTGTTTCTCC-3'. Strict precautions were taken to prevent 
contamination. Water instead of cDNA was included as a blank sample in each 
25 experiment. Amplification products were analyzed on 1 .5% agarose gels stained 
with ethidium bromide. 



Example 4- Microarray experiments 

For microarray analysis the GeneChip® System (Affymetrix, Santa Clara, 
30 California) was used. The targets for GeneChip® analysis were prepared 
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according to the current Expression Analysis Technical Manual. Briefly, lysates of 
the leukemia samples were homogenized (QIAshredder, Qiagen, Hilden, 
Germany) and total RNA extracted (RNeasy Mini Kit, Qiagen). Normally, 10 //g 
total RNA isolated from 1x10 7 cells was used as starting material in the 
5 subsequent cDNA-synthesis using oligo[(dT) 2 4T7promotor] 6 5 primer (cDNA 
Synthesis System, Roche Diagnostics, Mannheim, Germany). The cDNA was 
purified by phenol:chlorophorm:IAA extraction (Ambion, Austin, Texas) and 
acetate/ethanol precipitated over night. For detection of the hybridized target 
nucleic acid biotin-labeled ribonucleotides were incorporated during the in vitro 

10 transcription (Enzo® BioArray™ HighYield™ RNA Transcript Labeling Kit, ENZO, 
Farmingdale, USA). After quantification of the purified cRNA (RNeasy Mini Kit, 
Qiagen), 1 5 jjg was fragmented by alkaline treatment (200 mM Tris-acetate, pH 
8.2, 500 mM potassium acetate, 150 mM magnesium acetate) and added to the 
hybridization cocktail sufficient for 5 hybridizations on standard GeneChip® 

15 microarrays.. Before hybridization onto U95Av2, Test3 microarrays (Affymetrix) 
were chosen for monitoring of the integrity of the cRNA. Washing and staining of 
the probe arrays were performed according to the current protocols (Micro_1v1, 
EukGE-WS2v2). The Affymetrix software (Microarray Suite, Version 4.0.1) 
extracted fluorescence intensities from each element on the microarrays as 

20 detected by confocal laser scanning according to the manufacturers 
recommendations. Thirty-two out of 37 hybridization cocktails demonstrated high 
quality cRNA characteristics (Test3 probe arrays: 375' ratio of GAPDH probe sets 
<3.0) and were selected for building up class prediction models. 



25 Example 4- Class separation by principal component analysis 

Potential clusters corresponding to the genetic subgroups were visualized applying 
a two-step approach. The data were scaled from each array to a target intensity 
value 50 (Affymetrix Microarray Suite 4.0.1) in order to be able to perform inter- 
array comparisons. All data were permutated 100 cycles using the multiclass 
30 response parameter of the Significance Analysis of Microarrays algorithm 
(SAM)(25) (http://www-stat.stanford.edu/-tibs/SAM/index.html). The total set of 
12,600 genes was reduced to the significant differentially expressed genes. In a 
second step, the reduced set of genes was prepared for principal component 
analysis (PCA) and analyzed with J-Express(26) (http://www.molmine.com/). For 
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visualization in a two-dimensional plot we chose the first two principal components 
as they captured most of the variation in the original data set. 



Example 4- Class prediction by weighted voting(13) 

5 We adapted a previously described method to reduce the number of candidate 
genes that could distinguish between the three different cytogenetic AML 
subgroups(13). Briefly, to avoid division by zero or negative numbers as occurs 
due to the expression algorithm (Affymetrix Microarray Suite 4.0.1) we set all 
average fluorescence intensities of 1 or less to 1. Then, gene expression levels 

10 were log-transformed. Performing pairwise comparisons (A vs. B), for each gene g 
P(g,c) values and votes (defined by: P(g,c)=(m1(g)-m2(g))/(s1(g)+s2(g))) were 
calculated based on mean expression levels (m) and standard deviations (s) in the 
respective cytogenetic subgroup. Subsequently, votes were summed and 
prediction strength (PS) values reflected the margin of victory in the direction of 

15 either cytogenetic group A or B of the pairwise comparison. PS values range 
between 0 and 1, values >0.45 demonstrate significance (according to the 
permutation test). The relevance of selected genes was assessed by performing 
leave-one-out cross-validation. Only those genes that were contained in all cross 
validation classifiers were considered important. To determine a random 

20 association between genes we performed a permutation test (100 cycles). 
Because the number of informative genes, which are required to discriminate 
between samples, is unknown, we applied this method for different numbers of 
informative genes (range: 2 to 200). The minimal set of genes which provided 
optimal classification accuracy together with the highest prediction strength was 

25 selected to avoid overfitting. To visualize the identified genes and check their 
suitability for class separation a hierarchical cluster analysis was performed 
utilizing J-Express(26) (cluster method: average linkage; distance metric: 
euclidean). The accuracy of this class prediction model was validated on an 
independent test set of five cases of AML not fulfilling the cRNA high quality 

30 criterion as outlined above. 



Example 4- Multiple-tree classifier 
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As basic units in this classifier, classification trees are used(27-29). The optimal 
number of trees has been determined to be 15 (data not shown). Class votes of 
these trees are aggregated by a vote-by-majority rule. The classifier was fed with 
gene expression intensity values from a set of 973 genes that had been chosen 
5 based on their r statistic: 

-^1 

|>. 

where refers to the class averages, Jt to the overall average, a i to the within- 
class standard deviation, and summation is carried out over all k classes. The 
threshold was set to r > 0.75 . Classification trees were constructed as follows: tree 

10 building was performed while restricting trees to contain no more than n-1 nodes 
to discriminate between n classes. The C5.0 algorithm was used(28). The 
variables (gene expression intensities) used for tree construction were eliminated 
from the data set, and a new tree was calculated based on the truncated data set. 
This procedure was iterated until the predetermined number of trees had been 

15 reached. The accuracy of the multiple-tree classifier was estimated by 10-fold 
cross validation(30) and on an independent test set of data from 5 bone marrow 
aspirates, where the quality of the corresponding cRNA preparation was slightly 
lower than the high quality standards required for the training set. 



20 

EXAMPLE 4 - RESULTS 

Example 4- Characterization of leukemia samples 

We investigated 37 AML cases representing three defined cytogenetic aberrations 
corresponding to four FAB subtypes: t(8;21 )(q22;q22)/AML M2 (n=9), 

25 t(15;17)(q22;q12)/AML M3 or AML M3v (/7=10, n=8), and inv(16)(p13q22)/AML 
M4eo (n=10). All cases were characterized by cytomorphology, cytogenetics, 
FISH, and RT-PCR (Fig. 14). All cases with AML and t(8;21) had AML M2, all with 
AML and inv(16) had AML M4eo, ten cases with AML and t(15;17) had AML M3, 
and eight cases with AML and t(15;17) had AML M3v. All patients showed these 

30 balanced abnormalities as the sole karyotype change. Using FISH analysis, more 
than 65% of cells demonstrated the specific signal constellation. The respective 
fusion transcripts were detected by RT-PCR in all samples. The median age of all 
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patients was 53 years (range, 19-82 years; male:female=15:22) and did not differ 
between the respective groups. AML subtypes M3 and M3v both carry the same 
chromosomal aberration but differ in morphological aspects like nuclear 
configuration, granulation, and clinical aspects like white blood cell count (WBC). 
5 The median WBC count was 20,000///l (range, 800-1 68,000///l) and was strikingly 
lower in patients with AML M3 as compared to all other patients (median, 6,200 vs. 
36,500///!, P=0.0002). 



Example 4- Microarray analyses 

10 The gene expression profiles of 37 AML samples were evaluated. Thirty-two 
hybridization cocktails demonstrated high quality cRNA characteristics (Test3 
probe arrays: 375' ratio of GAPDH probe sets <3.0) and were selected for building 
class prediction models: t(8;21)/AML M2 (n=7), t(15;17)/AML M3 or M3v (n=9, 
n=7), and inv(16)/AML M4eo (n=9). Five cases were primarily excluded (375' 

15 ratios ranging between 3.9 and 5.4, see Methods) and were used for subsequent 
validations of the class prediction models: t(8;21)/AML M2 (n=2), t(15;17)/AML M3 
or M3v (n=1, n=1), and inv(16)/AML M4eo (n=1). 



Example 4- Class separation by principal component analysis 

20 In order to visualize clusters corresponding to the three underlying genetic 
subgroups we applied a two-step approach. Based on a permutation test (100 
permutations) we correlated our expression data to the three different cytogenetic 
parameters(25). We obtained 1000 significant genes. By principal component 
analysis we were able to clearly separate the three distinct chromosomal 

25 aberrations t(8;21), t(15;17), and inv(16) (Fig. 15)(26). These data suggest that 
genetically defined AML subtypes can be specified and identified based on their 
gene expression profiles. 



Example 4- Class prediction by weighted voting(13) 
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In order to identify the genes which enable the accurate discrimination of these 
subgroups, we applied the data analysis methodology introduced by Golub et 
al.(13). We selected the minimal set of genes which provided optimal classification 
accuracy together with' the highest prediction strength to avoid overfitting. Thirteen 
5 genes were sufficient to separate these AML subtypes with high precision (Table 
24; Table 24 shows that a minimal set of 13 genes (GenBank accession numbers 
are given) is sufficient for accurate class prediction with optimal classification 
accuracy and highest prediction strength. Comparisons (A vs. B) were performed 
either between two distinct subtypes or between one distinct subtype and all other 

10 subtypes (remainder), respectively. As calculated from pairwise comparisons, 
positive P(g,c) values indicate a higher expression in first class listed, negative 
P(g,c) values a higher expression in second class listed, respectively). GenBank 
accession numbers and detailed descriptions of the genes are given in table 25 
(Table 25: Thirty-six genes separate accurately three distinct cytogenetic AML 

15 subtypes. GenBank accession numbers, approved human gene nomenclature 
symbol (*=not approved)' and description of the function are presented. Six genes 
are included in the minimal set of both weighted voting according to Golub et 
al.(13) (total=13) and multiple-tree classifiers (total=29). 

All 32 clinical samples could be assigned to their corresponding cytogenetic 
20 subtype with best accuracy in leave-one-out cross-validation (1.0). Prediction 
strength values ranged from 0.91 to 0.98 (Table 24). To illustrate these results we 
applied hierarchical clustering(31). The resulting dendrogram clearly demonstrates 
the capacity of this subset of genes to separate all AML cases according to their 
cytogenetic aberration (Fig. 16). This demonstrates that class prediction of a 
25 chromosomal aberration in AML is feasable solely based on gene expression data. 

For external validation, we tested whether primarily excluded samples could also 
be accurately assigned to their specific cytogenetic category. Despite their non- 
optimal cRNA quality, all 5 cases were correctly classified with high prediction 
strength (0.76,1 .00,1 .00,1 .00,1 .00). 

30 

Example 4- Class prediction by multiple-tree models 
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As a second and independent methodological approach we developed a multiple- 
tree classifier to separate the three genetically defined subtypes based on the 
expression level of a minimal set of genes. In short, we computed classification 
trees to discriminate-between the different AML subclasses. To avoid overfitting of 
5 a singular tree model, we computed a multiple-tree model using an iteratively 
reduced set of genes. For each tree, we used only those genes that have not been 
used by the previously computed classification tree. The procedure is stopped 
when a predetermined number of trees has been reached. For this study, the 
optimal number of trees was calculated to be 15. The votes of the 15 trees were 
10 aggregated by a vote-by-majority rule. Equal votes for two of the three classes 
were counted as misclassification. 

The classifier utilized the expression values of 29 genes (MYH11 was identified 
twice by two different probe sets; Table 25) to discriminate between three classes, 
namely samples displaying t(15;17), t(8;21), and inv(16) (Fig. 17). The accuracy 
15 on the training set (r?=32) was 100%, and on the independent test set (n=5) 100%. 
The average accuracy in ten-fold cross validation was 94%. 

In summary, we identified 36 genes using two independent methodologies for 
class prediction in AML (Table 25). Six genes were described in both calculations, 
seven were found exclusively in the minimal set according to Golub et al.(13), and 
20 another 23 genes using multiple-tree classifiers. 

Example 4- Correlation of phenotype and gene expression profile 

We were able to demonstrate striking correlations between genotype and gene 
expression profiles in three genetically defined subgroups of AML. In addition, we 

25 answered the question, whether the cytogenetically identical AML with t(15;17) but 
appearing with two different phenotypes, AML M3 or AML M3v (Fig. 14), can also 
be separated by different gene expression patterns. We used 100-fold permutation 
of M3 (n=10) and M3v (n=8) data followed by principal component analysis and 
hierarchical cluster analysis based on 82 informative genes (data not shown). 

30 Separation into the corresponding two morphologically defined FAB subtypes M3 
and M3v was possible in all cases (Fig. 18) and suggests also a close correlation 
between phenotype and gene expression profile. 
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EXAMPLE 4 - DISCUSSION 

This is the first study to demonstrate an unequivocal association between disease- 
specific genetic alterations and distinct gene expression profiles. For each of the 
5 three analyzed clearly defined subtypes of AML (t(8;21), t(15;17), inv(16)) patterns 
of gene expression were identified that were homogeneous within all samples of 
the respective subgroups but clearly differed between these three subgroups. The 
analyzed samples represent disease subtypes that are specifically defined on the 
genetic and the phenotypic level by conventional diagnostics including 
10 cytomorphology, cytogenetics, and molecular genetics. 

By applying two independent approaches for the analysis of microarray data, the 
present study demonstrates that AML samples from previously defined 
subtypes(3) carrbe classified'adequately on the basis of gene expression profiles. 
It is intriguing that there is both sufficient coherence in gene expression within and 
15 difference between these subtypes to classify them with high accuracy even 
though the samples derive from the same myeloid cell lineage. 

In order to correlate gene expression with cytogenetics Virtaneva et al. compared 
the expression status of 6,606 genes of AML blasts with normal cytogenetics and 
trisomy 8 as the sole abnormality. While in this study normal CD34+ cells clustered 

20 into a distinct group, AML with trisomy 8 and AML with normal karyotype 
intercalated with each other. Microarray analyses showed an overall increased 
gene expression of genes located on chromosome 8 suggesting a gene-dosage 
effect(14). AML with trisomy 8 is heterogeneous on the phenotypic level as it 
occurs in different FAB subtypes. In contrast, AML with t(15;17), inv(16) and 

25 t(8;21) show a very close correlation to distinct morphological subtypes. 
Furthermore, trisomy 8 is probably not a primary, disease-defining aberration 
leading to AML as it also occurs in addition to a variety of different cytogenetic and 
molecular genetic abnormalities(32, 33). In contrast to this study, Armstrong et al. 
compared samples of the more homogeneous group of ALL with MLL 

30 translocations to ALL without MLL translocations and to AML(15). They 
demonstrated that ALL with MLL translocations comprizes a distinct disease which 
can be classified robustly by gene expression profiling. 

The main focus of the present analyses was the assessment of the differences 
35 between three highly characterized subgroups of AML defined by specific primary 
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chromosome aberrations. As anticipated, it was shown that AML with t(8;21) and 
AML with inv(16), which both involve alterations of the core binding factor- 
complex, are more related to each other as compared to AML with t(15;17)(34). 
Both phenotypically "different subtypes of AML with t(1 5; 17), AML M3 and AML 
5 M3v, cluster within one area. In an additional analysis, also the latter two subtypes 
were separated from each other based on their gene expression profiles. This data 
suggests the existence of further genetic and not yet identified alterations leading 
to the different phenotypes of AML M3 and AML M3v. One possible candidate 
gene is FLT3 which is mutated more frequently in AML M3v than in AML M3 (67% 
10 vs. 19%, P=0.001)(35). 



Several studies confirmed that gene expression profiles can be used for class 
prediction. This has been shown for acute leukemias, round blue cell tumors, and 
malignant melanomas(13, 36-38) as well as for different types of solid tumors 
using multi-class cancer classification(39). While the selection of different 
subgroups in these studies was performed using exclusively phenotypic criteria, 
other studies were based on genetically defined entities(40, 41). In the present 
study not only the discrimination of the three genetically defined AML subgroups 
was accomplished but also all these cases of AML were separated from normal 
bone marrow (data not shown)(42). 

To develop a classifier two independent approaches were applied. While 
classification by weighted voting according to Golub et al.(13) allows the 
discrimination between the three classes based on a minimal set of 13 genes, the 
multiple-tree classifier utilizes 30 genes. As indicated by cross-validation, 
25 generalization properties are excellent for the multiple-tree classifier, i.e. it is likely 
to perform equally well on new, unseen samples. Furthermore, it can be easily 
extended to more than the three subclasses described in the present study. 

Our classifiers contained genes already known to be primarily involved in the 
pathogenesis of the respective entities, namely MYH11(43) and ETO(44). 
30 Presumably, the detection of overexpression of MYH11 in inv(16) cases and of 
ETO in t(8;21) cases relates to the detection of the fusion gene transcripts rather 
than of the wild type transcripts. The other genes identified belong to various 
functional categories. Their potential pathogenetic significance in AML has to be 
clarified yet. 
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It is expected that the extension of the present analyses to currently less well- 
defined AML will identify additional subgroups of AML with clinical relevance 
based on their gene expression profiles. The feasibility of such an approach has 
been demonstrated for the "first time "for diffuse large B-celMymphoma(45). 
5 Alizadeh et al. have subdivided an entity previously considered homogeneous by 
various pathological methods into two not only new but also prognostically highly 
relevant subgroups. In two recent studies, gene expression profiling also in breast 
cancer revealed subgroups significantly differing in their prognosis(46, 47). With 
regard to AML, this approach may be most promising in AML with normal 
10 karyotype. This subgroup cannot be further defined on the cytogenetic level and is 
characterized by an intermediate prognosis possibly masking poor and favorable 
subgroups. 

In ad dition, the current data may have major implications with regard to delineating 
aberrant gene expression pathways underiyTng the palhogenesis of AML. As has 
1 5 been shown in mantle cell lymphoma and medullobIastoma(48, 49) the extension 
of our analyses to all subgroups of AML should enable us to define the 
deregulated genes important for the initiation and the progression of AML. Finally, 
these analyses will promote the identification of new targets for specific treatment 
approaches. 

20 
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Example 6: Correlation of Protein Expression and Gene Expression in Acute 

Myeloid Leukemia 

INTRODUCTION 

The determination of the surface and cytoplasmic expression of characteristic 
5 proteins by flow cytometry (FC) is a common method applied to the diagosis and 
the subclassification of acute myeloid leukemias (AML) 1 . The oligonucleotide 
microarray analysis (MA) represents a novel technology for the simultaneous 
detection of the mRNA abundance of large numbers of genes 2,3 . Based on specific 
gene-expression patterns distinct disease entities have been identified 4 * 6 . 

10 Therefore MA may become of major importance as a diagnostic tool for AML in the 
near future 7,8 . However, up to now data on the correlation between protein 
expression levels and mRNA abundance are limited 9 " 12 . To analyze the relation of 
^rofein"expression and mRNA abundance inAMtrwe- performed 450 individual 
comparisons of 29 genes in 25 patients with AML at diagnosis analyzed by FC and 

15 MA in parallel 13 . 

METHODS 
Samples 

Bone marrow samples from highly characterized patients with newly diagnosed 
20 and untreated AML were used. Samples had been analyzed by cytomorphology, 
cytochemistry, cytogenetics and molecular genetics in all cases and were 
characterized by either of the balanced chromosomal aberrations t(8;21), t(15;17), 
or inv(16) and the respective molecular and morphologic features 7 . The studies 
abide by the rules of the local Internal Review Board and the tenets of the revised 
25 Helsinki protocol. 
Flow cytometry 

The studies were performed on cells isolated from bone marrow by Ficoll-Hypaque 
density gradient centrifugation as described previously 14 . Applying triple-stainings 
and isotype controls monoclonal antibodies against 29 antigens were used in the 
30 following combinations as designed for diagnostic purposes (conjugated with the 
fluorochromes FITC, PE, and PC-5, respectively): CD34/CD2/CD33, 
CD7/CD33/CD34, CD34/CD56/CD33, CD11b/CD33/CD34, CD647CD4/CD45, 
CD157CD13/CD33, HLA-DR/CD33/CD34, CD34/CD135/CD33, 
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CD34/CD116/CD33, CD34/NG2/CD33, CD38/CD133*7CD34, CD61/CD14/CD45, 
CD36/CD235a/CD45, CD34/CD1 0/CD1 9, MPO**7LF**7cyCD1 5, . 
TdT/cyCD22/cyCD3, TdT/cyCD79a/cyCD3. All antibodies were purchased from 
Immunotech (Marseilles, France), except for: * = Medarex (Annandale, NJ); ** = 
5 Milteny Biotech (Bergisch Gladbach, Germany); *** = Caltag (Burlingame, CA). 
The respective combinations of antibodies were added to 1x10 6 cells (volume, 100 
jj\) and incubated for ten minutes at room temperature. The samples were then 
washed twice in phosphate-buffered saline (PBS) and resuspended in 0.5 ml PBS. 
FC analysis was performed using a FACSCalibur flow cytometer (Becton 

10 Dickinson, San Jose, CA). Analysis of list-mode files was performed by means of 
the CellQuest Pro Software (Becton Dickinson). Antigen expression was rated 
positive at a cut-off level of 20% of the cells within the mononuclear gate for 
membrane proteins and at a cut-off level of 10% for cytoplasmic antigens. Mean 
fluorescence intensity values were calculated for ail events with fluorescence 

1 5 values higher than isotype controls. 

Microarray experiments 

For microarray analysis the GeneChip® System (Affymetrix, Santa Clara, 
California) was used. The targets for GeneChip® analysis were prepared 

20 according to the current Expression Analysis Technical Manual. Briefly, lysates of 
the leukemia samples were homogenized (QIAshredder, Qiagen, Hilden, 
Germany) and total RNA extracted (RNeasy Mini Kit, Qiagen). Normally, 10 M9 
total RNA isolated from 1x10 7 cells were used as starting material in the 
subsequent cDNA-synthesis using oligo[(dT) 2 4T7promotor] 6 5 primer (cDNA 

25 Synthesis System, Roche Diagnostics, Mannheim, Germany). The cDNA was 

purified by phenol:chlorophorm:isoamylalcohol extraction (Ambion, Austin, Texas) 
and acetate/ethanol precipitated overnight. For detection of the hybridized target 
nucleic acid biotin-labeled ribonucleotides were incorporated during the in vitro 
transcription (Enzo® BioArray™ HighYield™ RNA Transcript Labeling Kit, ENZO, 

30 Farmingdale, USA). After quantification of the purified cRNA (RNeasy Mini Kit, 
Qiagen), 15 jjg were fragmented by alkaline treatment (200 mM Tris-acetate, pH 
8.2, 500 mM potassium acetate, 150 mM magnesium acetate) and added to the 
hybridization cocktail sufficient for 5 hybridizations on standard GeneChip® 
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microarrays. Before hybridization onto U95Av2, Test3 microarrays (Affymetrix) 
were chosen for monitoring of labelling efficiency and the integrity of the cRNA. 
Washing and staining of the probe arrays was performed according to the current 
protocols (Micro_1v1 , EukGE-WS2v4). The Affymetrix software (Microarray Suite, 
5 Version 4.0.1 ) extracted fluorescence intensities from each element on the 
microarrays as detected by confocal laser scanning according to the 
manufacturers recommendations. In order to be able to compare different 
experiments the global 

microarray intensities were scaled to a common target intensity. Furthermore, the 
10 mRNA abundance of the genes was qualitatively rated as a) present, b) marginal, 
and c) absent calls, respectively. 

Statistics - 

A total of 29 genes were analyzed in 25 patients with AML. The congruence of 
15 positivity and negativity of the expression of the respective genes as determined 
by FC and MA was analyzed for each gene in each individual patient. 
Comparisons of microarray intensities were performed by Mann-Whitney U-Xest. 
Analyses for bivariate correlations of mRNA and protein expression levels were 
performed by Pearson's correlation using SPSS, Version 10.0.7. 

20 

RESULTS AND DISCUSSION 

Twenty-five cases of AML were analyzed in parallel by FC and MA for the 
expression of 29 genes. Seven had AML M2 with t(8;21), 5 had AML M3 with 
t(15;17), 7 had AML M3v with t(15;17), and 6 had AML M4Eo with inv(16). A total 

25 of 450 comparisons of individual expression data obtained by both methods were 
performed. Of these, 399 (88.7%) revealed congruent results for protein 
expression and mRNA abundance (230 cases (51.1%) with positive expression 
and 1 69 cases (37.6%) with negative expression, respectively; table 26). In 30 
comparisons (6.7%) MA detected positivity for mRNA expression (call: present) 

30 while the results of FC indicated negativity. In 21 cases (4.7%) protein expression 
was demonstrated by FC while no mRNA expression was detected by MA (call: 
absent). 
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Focussing on the genes most specific for the diagnosis of AML, i.e. 
myeloperoxidase, CD13, and CD33, a high correlation between protein expression 
and mRNA abundance was observed (congruence in 73 of 75 comparisons 
(97%)). In detail, all cases were rated positive for expression of myeloperoxidase 
5 and all but one were positive for both CD13 and CD33, respectively, by both 
methods. Furthermore, for most other genes essential for the subclassification of 
AML as well as for the distinction of AML from acute lymphoblastic leukemia and 
chronic leukemias the results obtained by both methods were always congruent 
(i.e., for CD10, CD22, CD7, CD133, CD116, CD11b, CD61, CD45, HLA-DR, NG2) 
10 or were congruent in the majority (1 17/140, 84%) of cases (CD79a, CD19, CD2, 
CD3, CD15, Lactoferrin, CD14, CD235a, CD135, CD34; Table 26). 
Furthermore, the high correlations between protein expression and mRNA 
'abundance were not limited to congruence in positivity but were significantly 
correlated also quantitatively. To proof this, the protein expression levels and 
15 mRNA abundance were compared by Pearson's correlation in genes expressed in 
the majority of the analyzed cases. These comparisons revealed significant 
correlations for the fluorescence intensities as assessed by FC and MA for CD13 
(p=0.001), CD33 (p=0.034), CD34 (p=0.003), CD45 (p=0.015), CD15 (p=0.016), 
and CD7 (p=0.033) and thus further underline the high coherence of expression 
20 patterns for both protein and mRNA (figure 19). 

Thirty comparisons displayed mRNA expression and no protein expression. Due to 
the ongoing process of maturation (CD14, CD15) and due to the cross-lineage 
expression of the genes (CD3, CD19) the levels of mRNA abundance may have 
been to low to result in detectable protein expression levels using the described 
25 cut-off levels of 20% and 10%, respectively. This suggestion is supported by a 
quantitative analysis of mRNA expression data which shows relatively low albeit 
positive levels for the respective cases and genes (mean average fluorescence 
intensity, 46.7±54.5 in cases positive for CD14, CD15, CD3, or CD1 9 versus 
389.4±831 .0 in all positive cases, Mann-Whitney LMest: p<0.001) while at the 
30 same time protein expression amounts to a mean of 5±4%. 

Twenty-one comparisons displayed positivity by FC and negativity of MA, which 
comprize 4.7% of all individual comparisons performed. These discrepancies most 
probably are due to: a) erythrocytic debris positive for CD36 interfering with the 
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acquisition of CD36 negative cells during flow cytometric analysis; b) differences 
between both methods in the selected DNA sequences and antigen epitopes, 
respectively, detected (i.e. CD38, CD4, CD56); and c) differences in the stability of 
mRNA and protein of the respective genes. 
5 Overall, these results demonstrate for the first time that there is a significant 
correlation between protein expression and gene expression in AML and thai the 
antigens so far identified essential for the diagnosis and subclassification of AML 
by flow cytometry may represent additional candidate genes when using MA as a 
diagnostic tool for molecular cancer class prediction 15,16 . Furthermore, it is 

10 anticipated that the present analyses represent a prime example and will be 

reproduced for a variety of other entities like lymphoid malignancies. Due to their 
high potential to assess the expression patterns of high numbers of genes and due 
to their excellent reproducibility features microarrays are a promising future 
diagnostic tool. As a consequence, they may replace the more time and resource 

15 consuming diagnostic methods currently used for diagnosing leukemias like . 
cytomorphology, cytogenetics, and FC. 
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Example 6: Gene Expression Profiles of Distinct Cytogenetic AML 
Subtypes as Defined by the New WHO Classification: A Study of 45 
Patients 

Example 6: Introduction 

5 Since their introduction, microarrays have been promising tools for basic research. 
With regard to leukemia, the pivotal discrimination of unselected acute 
lymphoblastic (ALL), and acute myeloid leukemia (AML) samples based on their 
gene expression signatures inspired numerous studies (Golub et al., 1999). We 
performed gene expression analyses to designate candidate genes for 

1 0 discriminating specific AML samples from normal bone marrow (BM) of healthy 
volunteers. With regard to the classification of hematological malignancies 
according to the WHO, distinct AML subtypes have been established based on 
genetic abnormalities of the leukemic blasts. Here, we demonstrate gene 
expression analyses of 8 healthy BM donors and 45 leukemia patients 

1 5 representing four cytogenetic subtypes of AML: t(8;21 )(q22;q22), inv(1 6)(p1 3q22), 
t(15;17)(q22;q12), and t(1 1q23)/MLL. Combining different approaches for data 
analysis a minimal set of genes was identified to designate a reliable class 
prediction model. Based on the expression pattern of 39 genes, cytogenetically 
defined AML subtypes could accurately be predicted and separated from healthy 

20 BM. Taken together, gene expression signatures of AML cases with recurrent 

genetic abnormalities demonstrate a very close correlation between genotype and 
gene expression. Therefore, introducing a set of candidate genes, expression 
profiling may serve for diagnosis of AML subtypes defined by the new WHO 
classification. 

25 

Example 6 Material and Methods 

We analyzed BM aspirates from 8 healthy volunteers and the following 45 
untreated AML patients: 

• t(8;21 )(q22;q22)/AML M2 (n=9), 

30 • t(15;17)(q22;q12)/AMLM3/M3v(n=16), 
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• inv(16)(p13q22)/AML M4eo (n=10), and 

• t(1 1 q23)/MLL-aberrations (n=1 0) 



Example 6- Microarray experiments. Gene expression analyses were performed 
5 from cells remaining from the diagnostic sample. They had immediately been 
lysed, frozen and were stored at -80°C from 1 to 34 months until preparation for 
gene expression profiling. The targets for U95Av2 microarrays were prepared 
according to current protocols (Affymetrix). Before expression profiling, Test3 
Probe Arrays were chosen for monitoring the integrity of the cRNA. 

10 Example 6^- Results I: Characterization of leukemia samples 

AML samples were thoroughly characterized by a combination of cytomorphology, 
cytogenetics, FISH, RT-PCR and quantitative real-time PCR (Fig. 20). All patients 
showed the above mentioned balanced abnormalities as the sole karyotype 
change. Using FISH analysis, more than 90% of cells demonstrated the specific 

15 signal constellation. The respective fusion transcripts AML1-ETO in t(8;21), CBFD- 
MYH11 in inv(16), PML-RARD in t(15;17) and various MLL-fusion partners in 
t(11q23) were detected by PCR techniques in all samples. These subtypes are 
specifically associated with five cytomorphological subtypes according to FAB 
classification: inv(1 6)(p1 3q22)/AML M4eo, t(8;21 )(q22;q22)/AML M2, 

20 t(15;17)(q22;q12)/AML M3/M3v, and t(11q23)/MLL in FAB M5a/b, respectively. 
AML subtypes M3 and M3v both carry the same chromosome aberration but differ 
in morphological and clinical aspects. 

Example 6 - Results II: Class separation 

For data analysis we combined different approaches. First, a reduced subset of 
25 200 genes obtained by permutation-based neighborhood analysis (SAM, Tusher et 
al., 2001) was visualized for corresponding clusters using principal component 
analysis (J-Express, Dysvik et ah, 2001)(Fig.21). Samples from healthy donors 
cluster into a distinct group, likewise all AML samples demonstrate homogenity by 
forming a second cluster. 

30 Example 6 - Results 111: Class prediction 
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Next, we adapted the signal-to-noise/weighted voting algorithm (Golub et al., 
1999) to identify discriminative genes. A minimal set of 39 genes, which provided 
both optimal classification accuracy and highest prediction strength, was selected 
to avoid overfitting. The significance of each gene was tested by permutation- 
5 based neighborhood analysis. The robustness of the classifier was assessed by 
leave-one-out crossvalidation. These expression signatures were sufficient to 
distinguish AML samples with high accuracies from normal bone marrow and to 
predict the recurrent chromosome aberration, respectively (Table 27, Fig. 22). 
Table 28a shows for which comparison a gene was important including its 
10 statistical significance. 

A set of 39 genes is sufficient for class prediction. Accuracy denotes the rate of 
correctly classified test samples. P(g,c) indicates the signal-to-noise ratio of gene 
X-$x = (Mi-J^t&±^.vfaBTe /Jk and 6 k denote the mean expression and standard 
deviation of gene x in group k. As calculated from pairwise comparisons (class A 
15 vs. B), positive P(g,c) values indicate a higher gene expression in class A, 
negative P(g,c) values a higher gene expression in class B, respectively. HGNC 
symbols are given in column 1 . 

All leukemia samples could accurately be assigned to their corresponding 
cytogenetic subtype with 100% accuracies. To illustrate these results, a 
20 hierarchical clustering is shown (Fig. 23). 

Example 6 - Conclusions 

• The expression pattern of 39 genes allowed precise class assignments of four 
cytogenetically defined AML subtypes according to the WHO classification of 
hematological malignancies, and normal BM, respectively. 

25 • Thus, we introduce candidate genes suitable for diagnosis of AML subgroups 
based on gene expression profiling. 

• Potentially, gene expression patterns will allow the additional subclassification 
of AML, especially in subtypes with no specific cytogenetic markers (e.g. 
normal karyotype). 

30 

Example 7: Gene Expression Profiles of Distinct Leukemia Types and Subtypes: A 
Study of 280 Patients using high-density microarrays 
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Example 7: Introduction 

Here, we demonstrate gene expression analyses of 9 healthy BM donors and 271 
leukemia patients representing: 

AML: 4 distinct cytogenetic subtypes t(8;21)(q22;q22) (AML t(8;21)), 
5 inv(16)(p13q22) (AML inv(16)), t(15;17)(q22;q12) (AML t(15;17)), and 
t(1 1q23)/MLL (AML MLL). In addition, we analyzed AML samples characterized by 
normal karyotypes (AML normal), complex aberrant karyotypes (AML complex), 
trisomy 8 as sole aberration (AML +8), and other chromosomal changes (AML 
other). 

10 ALL: 3 distinct genetically defined subtypes: t(4;11)(q21;q23) (ALL t(4;11)), 
t(8;14)(q24;q32) (ALL t(8;14)), t(9;22)(q34;q1 1) (ALL Ph) and 2 subtypes defined 
by their immunophenotype: ALL of the B-lineage not carrying the t(9;22) (ALL B 
not Ph) and T-ALL (T-ALL) 

CLL: 5 genetically defined subtypes: trisomy 12 (tri 12), deletion 11q (11q-), 
15 deletion 13q (13q-), deletion 17p (17p-) and none of these aberrations (normal) 

CML (CML) without any further subdivison and 

Normal bone marrow from healthy volunteers (normal BM). 

20 We used the Affymetrix oligonucleotide microarray technology (GeneChip® 
Instrument System) to obtain gene expression profiles of each individual clinical 
sample of interest. The commercially available HG-U133 probe arrays gave 
information about the relative mRNA abundance of about 33,000 human genes 
which are represented on these high-density DNA-oligonucIeotide microarrays. 

25 Chip Information (as provided by manufacturer): 
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The GeneChip® Human Genome U133 Set (HG-U133A and HG-U133B) is 
comprised of two microarrays containing over 1,000,000 unique oligonucleotide 
features covering more than 39,000 transcript variants, which in turn represent 
greater than 33,000 of the best characterized human genes. This powerful set 
5 allows to reproducibly examine the quantitative and qualitative expression of most 
genes in the human genome, and was designed using the recently published and 
publicly available draft of the human genome sequence. Sequences used in the 
design of the array were selected from GenBank, dbEST, and RefSeq. Sequence 
clusters were created from Build 133 of UniGene (April 20, 2001) and refined by 

10 analysis and comparison with a number of other publicly available databases 
including the Washington University EST trace repository and the University of 
California, Santa Cruz golden-path human genome database (April 2001 release). 
In addition, ESTs were analyzed for untrimmed low-quality sequence information, 
correct orientation, false priming, false clustering, alternative splicing and 

15 alternative polyadenylation. 

Combining different approaches for data analysis, a set of genes was identified to 
designate a reliable class prediction model. Based on the expression pattern of 
those genes, defined leukemia types and subtypes could accurately be predicted 
and separated from healthy BM. Taken together, gene expression signatures 
20 demonstrate a very close correlation between genotype and gene expression. 
Therefore, introducing a set of candidate genes, measurements of mRNA 
abundancies by gene expression profiling serves for diagnosis of leukemia types 
and subtypes. 

Example 7 Material and Methods 

25 We analyzed BM aspirates from 9 healthy volunteers and the following 280 
leukemia patients: 

Acute myeloid leukemia (AML) 

t(8;21)(q22;q22)/AML M2 (n=13), 
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t(15;17)(q22;q12)/AML M3/M3v (n=20), 
inv(16)(p13q22)/AML M4eo (n=12), 
t(11q23)/MLL-aberrations (n=15) 
trisomy 8 (n=10) 
5 normal karyotype (n=62) 

complex aberrant karyotype (n=36) 
other aberrations (n=5) 
Acute lymphoblastic leukemia (ALL) 
t(4;11)(q21;q23)(n=9) 
10 t(8;14)(q24;q32) (n=4) 

t(9;22)(q34;q1 1 ) (ALL Ph) (n=1 5) 

ALL B lineage without t(9;22) (ALL B riot Ph) (n=9) 

T-ALL (n=9) 

Chronic lymphocytic leukemia (CLL) 
15 trisomy 12 (tri 12) (n=5) 
deletion 11q (11 q-) (n=4) 
deletion 13q (13q-) (n=10) 
deletion 17p (17p-) (n=4) 
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none of these aberrations (normal) (n=9) 

Chronic myeloid leukemia (n=14) 

Normal bone marrow (normal BM) (n=9) 

Example 7 - Results I: Characterization of leukemia samples 

5 We selected bone marrow (BM) samples from 271 leukemia patients at diagnosis 
representing 18 different disease entities or subentities and from 9 healthy 
volunteers, respectively. All cases were sent for reference diagnostics to our 
laboratory, registered in our leukemia database and were treated within 
prospective randomized multi-center trials. The studies abide by the rules of the 

10 local internal review board and the tenets of the revised Helsinki protocol. Samples 
were received either locally or by overnight mail. Diagnosis was performed by an 
individual combination of cytomorphology, cytogenetics, FISH, 
immunophenotyping and molecular genetics. Mononuclear cells were isolated by a 
Ficoll gradient, lysed, frozen and were stored at -80°C from one to 34 months until 

15 sample preparation for gene expression analysis. All leukemia samples were 
thoroughly characterized by a individual combination of cytomorphology, 
cytogenetics, immunophenotyping, fluorescence in situ hybridisation (FISH), 
polymerase chain reaction based methods both qualitative RT-PCR and 
quantitative real-time PCR. Using FISH analysis, more than 90% of cells 

20 demonstrated the specific signal constellation. The respective fusion transcripts 
BCR-ABL in t(9;22) positive CML (Schoch et al. 2002a) and in t(9;22) positive ALL, 
AML1-ETO in AML with t(8;21), CBFbeta-MYH1 1 in AML with inv(16), PML- 
RARalpha in AML with t(15;17) (Schoch et al. 2002b) and various MLL-fusion 
partners in both AML and ALL with t(11q23) were detected by FISH and PCR 

25 techniques in all samples. 

In t(8;14) positive ALL the IGH-C-MYC rearrangement was confirmed by FISH. In 
all cases with AML and complex aberrant karyotype 24 color FISH was performed 
in addition to chromosome banding analysis (Schoch et al. 2002c). 
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Genetic subtyping of CLL was carried out using interphase FISH with the following 
probes (Buhmann et al. 2002): 

- for the detection of trisomy 12 a centromere specific probe for chromosome 
12 

5 - for the detection of 1 1 q deletions probes for the ATM as well as for the RDX 
gene 

- for the detection of 13q deletions probes for the retinoblastoma gene (Rb), 
and the anonymous loci D13S25 and D13S319 

- for the detection of 17p deletion a probe for the p53 gene 

10 - cases with none of the above mentioned aberrations were assigned to the 
group normal 

References: . 

Buhmann R, Kurzeder C, Rehklau J, Westhaus D, Bursch S, Hiddemann W, 
Haferlach T, Hallek M, Schoch C. 

15 CD40L stimulation enhances the ability of conventional metaphase cytogenetics to 
detect chromosome aberrations in B-cell chronic lymphocytic leukaemia cells. 

Br J Haematol 2002 Sep;118(4):968-75 

Schoch C, Schnittger S, Kern W, Lengfelder E, Loftier H, Hiddemann W, Haferlach 
T. 

20 Rapid diagnostic approach to PML-RARalpha-positive acute promyelocyte 
leukemia. 

Hematol J 2002a;3(5):259-63 
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Schoch C, Schnittger S, Bursch S, Gerstner D, Hochhaus A, Berger U, Hehlmann 
R, Hiddemann W, Haferlach T. 

Comparison of chromosome banding analysis, interphase- and hypermetaphase- 
FISH, qualitative and quantitative PCR for diagnosis and for follow-up in chronic 
5 myeloid leukemia: a study on 350 cases, Leukemia 2002b Jan;16(1):53-9 

Schoch C, Haferlach T, Bursch S, Gerstner D, Schnittger S, Dugas M, Kern W, 
Loffler H, Hiddemann W. 

Loss of genetic material is more common than gain in acute myeloid leukemia with 
complex aberrant karyotype: A detailed analysis of 125 cases using conventional 
10 chromosome analysis and fluorescence in situ hybridization including 24-color 
FISH. 

Genes Chromosomes Cancer 2002 Sep;35(1):20-9 

1 i 

Example 7 - Results II: Sample preparation and microarray hybridisation 

Microarray analyses were performed utilising the GeneChip® System (Affymetrix, 
15 Santa Clara, USA). The targets for GeneChip® analyses were prepared according 
to the current Expression Analysis Technical Manual. Briefly, lysates of the 
leukemia samples were homogenised (QIAshredder, Qiagen, Hilden, Germany) 
and total RNA extracted (RNeasy Mini Kit, Qiagen). Normally, 5 pg total RNA 
isolated from 1x10 7 cells were used as starting material in the subsequent cDNA- 
20 synthesis using oligo[(dT) 2 4T7promotor] 6 5 primer (cDNA Synthesis System, Roche 
Applied Science, Mannheim, Germany). The cDNA was purified by 
phenol:chloroform:isoamyl alcohol (25:24:1) extraction (Ambion, Austin, USA) and 
acetate/ethanol precipitated over night. For detection of the hybridised target 
nucleic acid biotin-labeled ribonucleotides were incorporated during the in vitro 
25 transcription (Enzo® BioArray™ HighYield™ RNA Transcript Labeling Kit, ENZO, 
Farmingdale, USA). After quantification of the purified cRNA (RNeasy Mini Kit, 
Qiagen), 15 /yg labeled cRNA were fragmented by alkaline treatment (200 mM 
Tris-acetate, pH 8.2, 500 mM potassium acetate, 150 mM magnesium acetate) 
and added to the hybridisation cocktail sufficient for 5 hybridisations on standard 
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format GeneChip® microarrays. Before hybridisation to HG-U133 microarrays, 
Test3 microarrays (Affymetrix) were chosen in some cases for monitoring the 
integrity of the cRNA. Washing and staining of the probe arrays was performed 
according to the current protocols of the manufacturer (Fluidics Station, 
5 Micro_1v1, EukGE-WS2v4). The Affymetrix software (Microarray Suite, Version 
5.0) extracted fluorescence intensities from each feature on the microarrays as 
detected by confocal laser scanning according to the manufacturers 
recommendations. Some of the hybridization cocktails had previously been 
hybridized to U95Av2 arrays. Hybridization cocktails can be used for up to 5 
1 0 distinct array analyses. 

All hybridisation cocktails demonstrated high quality cRNA characteristics. We 
considered both low 375' ratio (e.g., lower than about 3) of housekeeping controls 
and the total number of present called genes (> about 30% on U133A), along with ' 
the average signal intensity of a present called gene. Expression profiles which 
15 fulfilled all quality control criteria were selected for subsequent supervised 
selection of informative genes. 

Example 7 - Results III: Statistical Analyses 

For data analysis we combined different approaches. First, the expression data 
was preprocessed. Raw expression intensities were scaled using the Affymetrix 

20 Microarray Suite software scaling parameter (target intensity: 5000). This 
preprocessing is based on a mask file which compares expression intensities of a 
set of 100 genes which code for ubiquitous housekeeping cellular proteins. This 
set of genes for normalisation of expression intensities is represented on both 
U133A and U133B arrays. The step of data preprocessing assures that array 

25 experiments can be compared properly using further statistical algorithms and 
methods. Subsequently, the data was analyzed according to two different 
established methods from as described below. The results from the two analyses 
were systematically compared to validate the list of differentially expressed genes. 

1 . Selection of differentially expressed genes 
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a) Analysis according to example 3. 

The top 20 differentially expressed genes were calculated for all disease entities 
and normal bone marrow, respectively, as described in example 3. Expression 
data were analyzed in order to select a minimal set of discriminative genes, which 
5 provides, as described hereinabove (Example 3), maximum classification accuracy 
in leave-one-out-crossvalidation. 

One-versus-all (OVA) and all-pairs comparisons (AP) were systematically applied. 
Genes were ranked according to signal-to-noise ratio (STN). For each OVA and 
AP comparison a set of discriminative genes is disclosed in tables 29, 32, 35, 38 
10 and 41 whereby the gene names can be found in tables 43a,b. The most 
discriminative and informative genes are marked by asterisks in tables 29, 32, 35, 
38 and 41. Classification accuracy was estimated by means of leave-one-out- 
crossvalidation and weighted voting. 

References: 

15 Golub TR, Slonim DK, Tamayo P, Huard C, Gaasenbeek M, Mesirov JP, Coller H, 
Loh ML, Downing JR, Caligiuri MA, Bloomfield CD, Lander ES. Molecular 
classification of cancer: class discovery and class prediction by gene expression 
monitoring. Science 1 999; 286(5439):531 -7 

Pomeroy SL, Tamayo P, Gaasenbeek M, Sturla LM, Angelo M, McLaughlin ME, 
20 KimJY, Goumnerova LC, Black PM, Lau C, Allen JC, Zagzag D, Olson JM, Curran 
T,Wetmore C, Biegel JA, Poggio T, Mukherjee S, Rifkin R, Calif ano A, 
StolovitzkyG, Louis DN, Mesirov JP, Lander ES, Golub TR. Prediction of central 
nervous system embryonal tumour outcome based on geneexpression. Nature 
2002; 415(6870):436-42. 

25 2. Estimation of classification accuracy 

A set of 20 top-ranked genes, which provided both optimal classification accuracy 
and highest prediction strength, was selected to avoid overfitting. The significance 
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of each gene was tested by permutation-based neighborhood analysis. The 
robustness of the classifier was assessed by leave-one-out crossvalidation. These 
expression signatures were sufficient to distinguish leukemia samples with high 
accuracies from normal bone marrow and also to predict the recurrent 
5 chromosome aberration, respectively (Tables 29, 32, 35, 38, 41). Accuracy 
denotes the rate of correctly classified test samples. P(g,c) indicates the signal-to- 
noise ratio of gene x: S x = (//i-//2)/(6i+6 2 ), where /j k and 6k denote the mean 
expression and standard deviation of gene x in group k. As calculated from 
pairwise comparisons (class A vs. B), positive P(g,c) values indicate a higher gene 
10 expression in class A, negative P(g,c) values a higher gene expression in class B, 
respectively. 

b)~ Analysis-according-to Westfall &-Young the -same data -set was analysed 
according to Westfall & Young to identify significantly differentially expressed 
genes with adjustment of the p-values for multiple testing. 

15 Step-down maxT and minP multiple testing procedures were applied, which 
compute permutation adjusted p-values for the step-down maxT and minP multiple 
testing procedures, which provide strong control of the family-wise Type I error 
rate (FWER). The multitest package (version 1 .0) from Bioconductor was applied, 
which is based on the R statistical language. These methods outperform other 

20 methods (see Dudoit, JASA 2002). 

References: 

Westfall PH, Young SS (1993) Resampling-based multiple testing: Examples and 
methods for p-value adjustment. John Wiley & Sons. ISBN 0-471-55761-7 

Dudoit S, Fridlyand J, Speed TP. 

25 Comparison of Discrimination Methods for the Classification of Tumors Using 
Gene Expression Data. JASA 2002; 97:77-87 

Package multtest (version 1 .0) 
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from Bioconductor http://www.bioconductor.org 
R statistical language: http://www.r-project.org/ 
c) Comparison of gene lists 

The list of differentially expressed genes obtained from 1a) and 1b) were 
5 systematically compared using PERL scripts in order to identify genes that 
occurred in both list, versus genes occurring in one list only. 

Expression intensities (expression levels) derived from the above-mentioned 
MicroArray Suite program were plotted as bar graphs showing gene expression 
profiles using _a_ Perl scripL(Figures 24 to 464). 

10 References: 

PERL: http://www.perl.com 

Sensitivities for the detection of leukemia types and subtypes were calculated as 
the number of positive samples predicted divided by the number of true positives. 



15 Specificities for the detection of leukemia types and subtypes were calculated as 
the number of negative samples predicted divided by the number of true 
negatives. 

Example 7 - Results IV: Analysis of 14 leukemia subtypes and normal bone 
marrow 

20 Here we analyzed in total 14 distinct leukemia types and subtypes as well a cohort 
of healthy volunteers for normal bone marrow characteristics. We applied the 
described two different statistical methods for identification of genes which allow 
accurate class assignments to the respective groups. 
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ALL t(4;11) (n=9) 

ALLt(8;14) (n=4) 

ALL B not Ph (n=9) 

ALLPh(n=15) 
5 T-ALL (n=9) 

AML+8 (n=10) 

AML complex (n=36) 

AML normal (n=62) 

AMLt(8;21) (n=13) 
10 AMLt(15;17) (n=20) 

AML inv(16) (n=12) 

AMLMLL (n=15) 

CLL (n=32) 

CML (n=14) 
15 normal BM (n=9) 

total: 269 samples 

First, expression data were analyzed according to example 3, as described 
hereinabove. 
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A set of 20 top-ranked genes, which provided both optimal classification accuracy 
and highest prediction strength for all pairwise (all pairs)' and one-versus-all 
comparisons is given as table 29. Within this set of genes, optimal classification 
accuracy can be obtained with genes marked by asterisks. Gene expression 
5 intensities, plotted as bar graphs are given in Figures 24 to 188. Genes are 
depicted as unique Affymetrix identifier (for example 201497_x_at) and, where 
available, approved HGNC symbols (HUGO Gene Nomenclature Committee). 
More detailed, the complete annotation and sequence information about this set of 
genes is listed in tables 43a,b. 

10 In total 269 cases with leukemia or normal bone marrow (BM) were analyzed. 248 
of 269 (92.2%) cases were assigned to the correct leukemia type in all pairwise 
comparisons (table 28 b). The sensitivity indicated for each subgroup indicates 
the percentage of cases of the specific subgroup identified correctly in all pairwise 
comparisons (range 60% to 100%). The specificity indicates for each subgroup the 

15 percentage of correct assignments to this subgroup (range 85.3% to 100%). 

In total 3766 individual assignments of leukemia and normal bone marrow were 
analyzed. 3745 of 3766 assignments (99.4%) were correct (table 28c). The 
sensitivity indicated for each subgroup indicates the percentage of correct 
20 assignments for cases of the specific subgroup in pairwise comparisons, (range 
97.1% to 100%). The specificity indicates for each subgroup the percentage of 
correct assignments to this subgroup (range 98.4% to 100%). 

In a second approach significant genes were identified according to Westfall & 
Young. Table 30 represents all genes found to be significant after p-value 
25 adjustment. Genes are depicted as unique Affymetrix identifier (for example 
201497_x_at) and, where available, approved HGNC symbols (HUGO Gene 
Nomenclature Committee). More detailed, the complete annotation and sequence 
information about this set of genes is listed in table 43a, b. 

Furthermore, we provide information about genes which were found to be rated 
30 significant independently by both methodologies (Table 30). Top-significant genes 
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according to the method of example 3 are marked by asterisks. Genes which were 
included in any of the top-20 lists are marked by positive signs. . 

In addition, selected gene profiles were chosen to demonstrate their capability of 
discriminating .different leukemia types, subtypes and normal bone marrow, 
5 respectively. Gene expression profiles were generated by means of PERL- 
programs, evaluated and plotted as bar graphs. Each of the analyzed groups are 
accordingly outlined. The following genes were selected and are given as Figures 
189 to 233: 



GenelD 


gene symbol 


feature 


201162_at 


IGFBP7 


CLL low 


201163_s_at 


IGFBP7 


CLL low 


201362_at 


NS1-BP 


CML high 


201496_x_at 


MYH11 


AML inv(16) high 


201497_x_at 


MYH11 


AML irw(16) high 


201998_at 


SIAT1 


CLL high 


202095_s_at 


BIRC5 


CLL low 


203074_at 


ANXA8 


AMLt(15;17) high 


204150_at 


STAB1 


AMLt(15;17) high 


20451 1_at 


KIAA0793 


CLL high 


205528_s_at 


CBFA2T1 


AML t(8;21) high 


205529_s_at 


CBFA2T1 


AML t(8;21)high 
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205805_s_at 


ROR1 


CLL high 


206940_s_at 


POU4F1 


AML t(8;21) high 


20781 9_s_at 


ABCB4 


CLL high 


208091_s_at 


DKFZP564K0822 


CLL high 


208456_s_at 


RRAS2 


CLL high 


209061 _at 


NCOA3 


CLL high 


2091 01 _at 


CTGF 


ALL t(4;11) high, 
ALL Ph high, T- 
ALL high 


209374__s__at 


IGHM 


CLL high 


20961 6_s_at 


CES1 


AML MLL high 


210997_at 


hiGF 


AMLt(15;17) high 


212285_s_at 


AGRN 


AMLt(15;17) high 


213539_at 


CD3D 


T-ALL high 


214450_at 


CTSW 


AML t(15;17) high 


215925_s_at 




ALL t(4;11) high 


218223_s_at 


LOC51177 


CML low 


222166__at 




AML +8 high 


224520_s_at 


MGC13168 


ALLt(8;14) high 


224794 s at 


LOC51148 


AMI t/1 *v1 y\ hinh 


225660_at 


SEMA6A 


ALL B not Ph 
high, ALL Ph high 
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one A OC of 


Homo sapiens, Similar to hypothetical protein FLJ2261 1 , clone MGC:2471 6 
IMAGE:4277726. mRNA, complete cds 


ALL high, CLL 
high 


228827__at 


Homo sapiens clone 25023 mRNA sequence 


AMLt(8;21) high 


228904_at 


ESTs 


AML normal high, 
AML +8 high, AML 
complex high 


236301_at 


Homo sapiens, clone IMAGE:3866403, mRNA 


CLL high 


236892_s_at 


HOXB6 


AML normal high, 
AML +8 high, AML 
complex high 


23921 4_at 


ESTs 


ALL t(4;11) high 


239393_at 


ESTs 


ALL t(4;11) high 


23979 1_at 


HOXB6 


AML normal high, 
AML +8 high 


240581_at 


ESTs 


ALL t(4;11) high 


241464_s_at 


ESTs 


AML MLL high, 
AML normal high, 
AML +8 high, AML 
complex high 


<£4! 5<£D_at 


to I S 


AML inv(16) high 


<£4ooo<_s_at 


Lcri 


ALL high, CLL 
high 


36566_at 


CTNS 


T-ALL low 


38487_at 


FU 12442 


AMLt(15;17) high 
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Generally, chromosomal aberrations are strongly associated with morphological 
characteristics. However, there are two chromosomal aberrations which are 
observed in both myeloid and lymphatic neoplasms, i.e. t(11q23)/MLL and the 
t(9;22). The t(9;22) occurs in ALL (ALL Ph) and CIVIL, and t(11q23)/MLL is 
5 observed in ALL (ALL t(,4;11)) and AML (AML MLL), respectively. Analysing gene 
expression signatures of both t(9;22) positive ALL and CML we identified genes, 
which allowed correct lineage assignments (table 29). In addition, our results 
indicate that the distinct expression signatures are also sufficient for correct 
assignments of the t(1 1q23)/MLL positive leukemias either to ALL or to AML (table 
10 29). Thus, in both scenarios lineage assignment (lymphoid or myeloid), and even 
subtype classification can be accomplished based on the methods and markers 
described herein, despite of the fact that e.g., in the above-noted t(11q23) and 
t(9;22) chromosomal aberrations, the same chromosomal aberration is associated 
with different kinds of leukemia. 



Example 7 - Results V: Analysis of 5 ALL subtypes defined by genetics and 



Here we analyzed in 5 distinct ALL subtypes. We applied the described two 
different statistical methods for identification of genes which allow accurate class 
20 assignments to the respective groups. 



immunophenotype 



ALL t(4;11) 



(n=9) 



ALL t(8;14) 



(n=4) 



ALL B not Ph 



(n=9) 



ALL Ph 



(n=15) 



25 T-ALL 



(n=9) 
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First, expression data were analyzed according to example 3, as described 
hereinabove. 

A set of 20 top-ranked genes, which provided both optimal classification accuracy 
and highest prediction strength for all pairwise (all pairs) and one-versus-all 
5 comparisons is given in table 32. Within this set of genes, optimal classification 
accuracy can be obtained with genes marked by asterisks. Gene expression 
intensities, plotted as bar graphs are given in Figures 234 to 252. Genes are 
depicted as unique Affymetrix identifier (for example 201497_x_at) and, where 
available, approved HGNC symbols (HUGO Gene Nomenclature Committee). 
10 More detailed, the complete annotation and sequence information about this set of 
genes is listed in table 43a,b. 

In total 46 cases of ALL were analyzed. 44 of 46 cases (95.7%) were assigned to 
the correct ALL subtype in all pairwise comparisons (table 31a). The sensitivity 
indicated for each subgroup indicates the percentage of cases of the specific 
15 subgroup identified correctly in all pairwise comparisons (range 88.9% to 100%). 
The specificity indicates for each subgroup the percentage of correct assignments 
to this subgroup (range 88.9% to 100%). 

In total 184 individual assignments of ALL were analyzed. 182 of 184 assignments 
(98.9%) were correct (table 31b). The sensitivity indicated for each subgroup 
20 indicates the percentage of correct assignments for cases of the specific subgroup 
in pairwise comparisons, (range 97.2% to 100%). The specificity indicates for each 
subgroup the percentage of correct assignments to this subgroup (range 97.2% to 
100%). 

In a second approach significant genes were identified according to Westfall & 
25 Young. Table 33 represents all genes found to be significant after p-value 
adjustment. Genes are depicted as unique Affymetrix identifier (for example 
201497_x_at) and, where available, approved HGNC symbols (HUGO Gene 
Nomenclature Committee). More detailed, the complete annotation and sequence 
information about this set of genes is listed in table 43a,b. 
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Furthermore, we provide information about genes which were found to be rated 
significant independently by both methodologies (Table 33). Top-significant genes 
according to the method of example 3 hereinabove are marked by asterisks. 
Genes which were included in any of the top-20 lists are marked by positive signs. 

5 In addition, selected gene profiles were chosen to demonstrate their capability of 
discriminating different leukemia types, subtypes and normal bone marrow, 
respectively. Gene expression profiles were generated by means of PERL- 
programs, evaluated and plotted as bar graphs. Each of the analyzed groups are 
accordingly outlined. The following genes were selected and are given as Figures 
10 253 to 271: 



GenelD 


gene symbol 


feature 


201105_at 


LGALS1 


ALLt(4;11) high 


204044__at 


QPRT 


ALL t(4;11) high 


205899_at 


CCNA1 


ALL t(4;11) high 


209168_at 


GPM6B 


ALL t(4;11)high 


213539_at 


CD3D 


T-ALL high 


213894_at 


KIAA0960 


ALL t(4;11) high 


215925_s_at 




ALLt(4;11) high 


218224_at 


PNMA1 


T-ALL high 


219463_at 


C20orf103 


ALL t(4;11)high 


219631_at 


FU12929 


T-ALL high 


225563_at 


ESTs 


ALL t(4;11) high 
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225592_at 


NRM 


ALL t(4;11)high 


228083_at 


Homo sapiens mRNA; cDNA DKFZp434M216 (from clone DKFZp434!1216) 


ALL t(4;11) high 


228988_at 


ZNF6 


T-ALL high 


235749__at 




ALL t(8;14) high 


24241 4_at 


ESTs 


ALL t(4;11) high 


243756_at 


ESTs 


ALL t(4;11) high 



Example 7 - Results VI: Analysis of 8 AML subtypes 



Here we analyzed in total 8 distinct AML subtypes. We applied the described two 
5 different statistical methods for identification of genes which allow accurate class 
assignments to the respective groups. 



trisomy 8 (n=10) 

other aberrant (n=5) 

complex (n=36) 

1 0 normal (n=62) 

t(8;21) (n=13) 

t(15;17) (n=20) 

inv(16) (n=12) 

MLL (n=15) 
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First, expression data were analyzed according to example 3 as described 
hereinabove. 

A set of 20 top-ranked genes, which provided both optimal classification accuracy 
and highest prediction strength for all pairwise (all pairs) and one-versus-all 
5 comparisons is given as table 35. Within this set of genes, optimal classification 
accuracy can be obtained with genes marked by asterisks. Gene expression 
intensities, plotted as bar graphs are given in Figures 272 to 336. Genes are 
depicted as unique Affymetrix identifier (for example 201497_x_at) and, where 
available, approved HGNC symbols (HUGO Gene Nomenclature Committee). 
10 More detailed, the complete annotation and sequence information about this set of 
genes is listed in table 43a,b. 

In total 173 cases of AML were analyzed. 160 of 174 cases (92.5%) were 
assigned to the correct AML subtype in all pairwise comparisons (table 34a). The 
sensitivity indicated for each subgroup indicates the percentage of cases of the 
15 specific subgroup identified correctly in all pairwise comparisons (range 60% to 
100%). The specificity indicates for each subgroup the percentage of correct 
assignments to this subgroup (range 85.5% to 100%). 

In total 1211 individual assignments of AML were analyzed. 1198 of 1211 
assignments (98.9%) were correct (table 34b). The sensitivity indicated for each 
20 subgroup indicates the percentage of correct assignments for cases of the specific 
subgroup in pairwise comparisons (range 94.3% to 100%). The specificity 
indicates for each subgroup the percentage of correct assignments to this 
subgroup (range 97.7% to 100%). 

In a second approach significant genes were identified according to Westfall & 
25 Young. Table 36 represents all genes found to be significant after p-value 
adjustment. Genes are depicted as unique Affymetrix identifier (for example 
201497_x_at) and, where available, approved HGNC symbols (HUGO Gene 
Nomenclature Committee). More detailed, the complete annotation and sequence 
information about this set of genes is listed in table 43a,b. 
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Furthermore, we provide information about genes which were found to be rated 
significant independently by both methodologies (Table 36). Top-significant genes 
according to the method of example 3 are marked by asterisks. Genes which were 
included in any of the top-20 lists are marked by positive signs. 

5 In addition, selected gene profiles were chosen to demonstrate their capability of 
discriminating different leukemia types, subtypes and normal bone marrow, 
respectively. Gene expression profiles were generated by means of PERL- 
programs, evaluated and plotted as bar graphs. Each of the analyzed groups are 
accordingly outlined. The following genes were selected and are given as Figures 
10 337 to 370: 



GenelD 


gene symbol 


feature 


201497_x_at 


MYH11 


AML inv(16) high 


228827_at 


Homo sapiens clone 25023 mRNA sequence 


AMLt(8;21) high 


38487_at 


FLJ12442 


AMLt(15;17) high 


203074_at 


ANXA8 


AML t(15;17) high 


205528_s_at 


CBFA2T1 


AML t(8;21) high 


205529_s_at 


CBFA2T1 


AMLt(8;21) high 


206940__s_at 


POU4F1 


AMLt(8;21) high 


211341_at 


POU4F1 


AMLt(8;21) high 


201496_x_at 


MYH11 


AML inv(16) high 


228660_x_at 


SEMA4F 


other high 


20271 8_at 


IGFBP2 


AML t(15;17) high 
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20538CLat 


PDZK1 


other high 


202746_at 




AML MLL low 


201596_x_at 


KRT18 

i 


AMLt(8;21)low 


3421 0_at 


CDW52 


AMLt(15;17) low 


212850_s_at 


LRP4 


AML inv(16) high 


228904_at 


ESTs 


AMI t/ft*91\ I/ma/ 
nWIL l\O t tZ\) IOW, 

AML t(15;17) low, 
AML inv(16) low, 
AML MLL low 


203151__at 


MAP1A 


AMLt(8;21) low 


201137_s_at 


HLA-DPB1 


AML t(15;17) low 


200675_at 


CD81 


AML inv(16) low 


201425_at 


ALDH2 


AMLt(8;21) low 


202085_at 


TJP2 


AML inv(16) low 


20261 9_s_at 


PLOD2 


AML MLL low 


203092_at 


TIMM44 


AML lnv(16) low 


204425_at 


ARHGAP4 


AMLt(15;17) low 


205366_s_at 


HOXB6 


AML t(15;17) low, 
AML inv(16) low, 
AML MLL low 


205472_s_at 


DACH 


*ML MLL high 


206761 _at 


TACTILE " ; 


*ML MLL low 
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222166_at 




AML +8 low 


222335_at 


ESTs 


AML MLL low 


22331 8_s_at 


MGC10974 


AML complex low 


225330_at 


Homo sapiens, clone MGC:18216 IMAGE:41 56235, mRNA, complete cds 


AML inv(16) low 


231277_x_at 


ESTs 


AML complex low 


635_s_at 


PPP2R5B 


other low 



Example 7 - Results VII: Analysis of 5 genetically defined CLL subtypes 

Here we analyzed in total 5 genetically defined CLL subtypes. We applied the 
described two different statistical methods for identification of genes which allow 
5 accurate class assignments to the respective groups. 

trisomy 12 (n=5) 

11q- (n=4) 

13q- (n=10) 

17p- (n=4) 

10 normal (n=9) 

First, expression data were analyzed according to example 3 as described 
hereinabove. 

A set of 20 top-ranked genes, which provided both optimal classification accuracy 
and highest prediction strength for all pairwise (all pairs) and one-versus-all 
15 comparisons is given as table 38. Within this set of genes, optimal classification 
accuracy can be obtained with genes marked by asterisks. Gene expression 
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intensities, plotted as bar graphs are given in Figures 371 to 404. Genes are 
depicted as unique Affymetrix identifier (for example 201497_x_at) and, where 
available, approved HGNC symbols (HUGO Gene Nomenclature Committee). 
More detailed, the complete annotation and sequence information about this set of 
5 genes is listed in table 43a,b. 

In total 32 cases of CLL were analyzed. 31 of 32 cases (96.9%) were assigned to 
the correct CLL subtype in all pairwise comparisons (table 37a). The sensitivity 
indicated for each subgroup indicates the percentage of cases of the specific 
subgroup identified correctly in all pairwise comparisons (range 90% to 100%). 
10 The specificity indicates for each subgroup the percentage of correct assignments 
to this subgroup (range 90% to 100%). 

In total 128 individual assignments of CLL were analyzed. 127 of 128 assignments 
(99.2%) were correct (table 37b). The sensitivity indicated for each subgroup 
indicates the percentage of correct assignments for cases of the specific subgroup 
15 in pairwise comparisons (range 97.5% to 100%). The specificity indicates for each 
subgroup the percentage of correct assignments to this subgroup (range 97.3% to 
100%). 

In a second approach significant genes were identified according to Westfall & 
Young. Table 39 represents all genes found to be significant after p-value 
20 adjustment. Genes are depicted as unique Affymetrix identifier (for example 
201497_x_at) and, where available, approved HGNC symbols (HUGO Gene 
Nomenclature Committee). More detailed, the complete annotation and sequence 
information about this set of genes is listed in table 43a,b. 

Furthermore, we provide information about genes which were found to be rated 
25 significant independently by both methodologies (Table 39). Top-significant genes 
according to the method of example 3 are marked by asterisks. Genes which were 
included in any of the top-20 lists are marked by positive signs. 

Example 7 - Results VIII: Analysis of the four major leukemia types (ALL, AML, 
CLL, CML) and normal bone marrow 
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Here we analyzed in total 4 . major leukemia types as well a cohort of healthy 
volunteers for normal bone marrow characteristics. We applied the described two 
different statistical methods for identification of genes which allow accurate class 
assignments to the respective groups. 

5 ALL (n=47) 

AML (n=175) 

CLL (n=35) 

CML(n=14) 

Normal bone marrow (n=9) 

10 First, expression data were analyzed according to example 3 as described 
hereinabove. 

A set of 20 top-ranked genes, which provided both optimal classification accuracy 
and highest prediction strength for all pairwise (all pairs) and one-versus-all 
comparisons is given as table 41 . Within this set of genes, optimal classification 

15 accuracy can be obtained with genes marked by asterisks. Gene expression 
intensities, plotted as bar graphs are given in Figures 405 to 431. Genes are 
depicted as unique Affymetrix identifier (for example 201497_x_at) and, where 
available, approved HGNC symbols (HUGO Gene Nomenclature Committee). 
More detailed, the complete annotation and sequence information about this set of 

20 genes is listed in table 43a,b. 

In total 280 cases of leukemia and normal bone marrow (BM) were analyzed. 263 
of 280 cases (93.9%) were assigned to the correct leukemia subtype or normal 
bone marrow in all pairwise comparisons (table 40a). The sensitivity indicated for 
each subgroup indicates the percentage of cases of the specific subgroup 
25 identified correctly in all pairwise comparisons (range 76.6% to 98.3%). The 
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specificity indicates for each subgroup the percentage of correct assignments to 
this subgroup (range 88.9% to 97.1%). 

In total 1120 individual assignments of leukemia subtype or normal bone marrow 
were analyzed. 1103 of 1120 assignments (98.5%) were correct (table 40b). The 
5 sensitivity indicated for each subgroup indicates the percentage of correct 
assignments for cases of the specific subgroup in pairwise comparisons (range 
94.2% to 99.3%). The specificity indicates for each subgroup the percentage of 
correct assignments to this subgroup (range 97.2% to 99.3%). 

In a second approach significant genes were identified according to Westfall & 
10 Young. Table 42 represents all genes found to be significant after p-value 
adjustment. Genes are depicted as unique Affymetrix identifier (for example 
201497_x_at) and, where available, approved HGNC symbols (HUGO Gene 
Nomenclature Committee). More detailed, the complete annotation and sequence 
information about this set of genes is listed in table 43a,b. • 

15 Furthermore, we provide information about genes which were found to be rated 
significant independently by both methodologies (Table 42). Top-significant genes 
according to the method of example 3 are marked by asterisks. Genes which were 
included in any of the top-20 lists are marked by positive signs. 

In addition, selected gene profiles were chosen to demonstrate their capability of 
20 discriminating different leukemia types, subtypes and normal bone marrow, 
respectively. Gene expression profiles were generated by means of PERL- 
programs, evaluated and plotted as bar graphs. Each of the analyzed groups are 
accordingly outlined. The following genes were selected and are given as Figures 
432 to 464: 

25 



GenelD 


gene symbol 


feature 


202503_s_at 


KIAA0101 


CLL low 
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202580_x_at 


FOXM1 


CLL low 


202709_at 


FMOD 


CLL high 


204882_at 


KIAA0053 


CLL high 


205049_s_at 


CD79A 


ALL high, CLL 
high 


205051 _s_at 


KIT 


AML high 


205382_s_at 


DF 


AML high 


205599_at 


TRAF1 


CML low CLL high 


206255_at 


BLK 


ALL high, CLL 
high 


206398_s_at 


CD19 


ALL high, CLL 
high 


210487_at 


DNTT 


ALL high 


210948_s_at 


LEF1 


ALL high, CLL 
high 


21l352_s_at 


NCOA3 

i 


CLL high 


211404_s_at 


APLP2 


AML high 


214761_at 


OA2 


ALL high 


217950_at 


NOSIP 


CLL high 


218090_s_at 




CLL high 


218516_s_at 


FU20421 


normal BM low 


218916_at 


FU23436 


normal BM low 
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219753_at 


STAG 3 


ALL high 


221969_at 


PAX5 


ALL high, CLL 
high 


223703_at 


CDA017 


AML high, CML 

high, normal BM 
• 

high 


226147_s_at 


Homo sapiens cDNA: FU22667 fis, clone HSI08385 


CLL high 


22847 1_at 


ESTs 


CLL high 


229487_at 


ESTs 


ALL high 


229790_at 


TERF2 


CML low, BM low 


231736_x_at 


MGST1 

• 


AML high, CML 
high, normal BM 
high 


231854_at 


Homo sapiens cDNA FU1 1448 fis, clone HEMBA1001391 


CML low 


239287_at 


ESTs 


CLL high 


243362_s_at 


LEF1 


ALL high 


243363_at 


LEF1 


ALL high, CLL 
high 


41577_at 


PPP1R16B 


CML low 



Tables 43a, b: functional gene annotation for genes identified to be differentially 
expressed between different types of leukemia, or between healthy bone marrow 
and leukemia, respectively. 

5 As described by the GeneChip manufacturer, for each probeset (for example 
200093_s__at_HG-U133A), a GenBank or RefSeq accession number was chosen 



BNSDOCID: <WO 03039443A2J_> 



WO 03/039443 



PCT/EP02/12303 



140 

to represent the target sequence. Using this accession number, a UniGene cluster 
(in current release) was identified where the accession number was used. If there 
is a link to LocusLink in the UniGene record, then annotations were retrieved from 
LocusLink. Those annotations include gene symbol, location, OMIM, EC, Gene 
5 Ontology (GO), description and RefSeq sequence accession. The RefSeq 
accession was linked to the protein annotations, which include domain 
identification (Pfam and BLOCKS), similarity search (blastp nr) and family 
classification (SCOP, EC and GPCR HMM searches). 



10 Target sequence information for all the probes which were identified to be able to 
distinguish between different types and subtypes of leukemia and normal bone 
marrow, respectively, are given in Table 44. 

As further described by the GeneChip manufacturer, the HG-U133 Target 
Databank is a compilation of probe set annotations and target sequence 
15 information for all the probes represented on the HG-U133 A and B arrays. Target 
sequences are the relatively short (typically around 300-600 bp) sequences 
against which probes have been designed on a GeneChip® array. These target 
sequences can be thought of as a subsequence of the Consensus/Exemplar 
sequence. 

> 

20 The Consensus/Exemplar sequences (i.e., the coding or full cDNA sequences 
corresponding to the markers described herein as being able to distinguish 
between different types and subtypes of leukemia and normal bone marrow) for 
most markers are given in Table 45. 

Example 7 Conclusions 

25 The expression pattern of genes allowed precise class assignments of defined 
leukemia types and subtypes according to the WHO classification of hematological 
malignancies, and normal BM, respectively. 
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Thus, we introduce candidate genes suitable for diagnosis of leukemia types and 
subtypes based on gene expression profiling. 

These data demonstrate the utility of gene expression profiling for the 
discrimination of all leukemia major entities and most subentities. In total, up to 14 
5 different leukemia types and subtypes could clearly be distinguished from each 
other and from normal BM, respectively. These leukemias are associated with 
highly differing prognoses and require specific treatment strategies. By performing 
these analyses on a single platform requiring basic molecular biological methods, 
this approach provides a broad access to high-quality diagnosis of leukemia. 
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Golub 




invention 








A- 


samples: 18/85 








A - 


samples: 18 / 85 






accuracy 


0,87 








accuracy 


0,96 






confidence 


0,77 








confidence 


0,88 






failed 


6, 19,22,26,78,79,80,8 1 ,82,83,84,85,99 




failed 


5,6,19,22 






gene 


signal-to-noise 


P. 


decision limit 




gene 


signal-to-noise 


P 


decision limit 


gl 


-1,14 


0* 


482,01 




gl 


-1,14 


0 




g2 


-1,06 


0* 


192,17 






-1,06 


0* 


98,50 


g3 


-0,97 


0* 


207,67 




g3 


-0,97 


0 




g4 


0,94 


0* 


205,05 




g4 


0,94 


0 




g5 


-0,93 


o* 


1818,11 




g5 


-0,93 


0 




g6 


0,93 


0* 


451,74 




g6 


0,93 


0 




g7 


-0,91 


0* 


23,84 




g7 


-0,91 


0 




g8 


-0,90 


0* 


225,72 




g8 


-0,90 


0 






0,90 


0* 


43,85 




g9 


0,90 


0 




gio 


0,89 


0* 


210,78 




glO 


0,89 


0 




gll 


-0,88 


0* 


118,63 




gll 


-0,88 


0 




gl2 


0,87 


0* 


55,39 




gl2 


0,87 


0* 


67,80 
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gl3 


0,87 


0* 


127,15 


gl3 


0,87 


0* 


164,10 


gl4 


0,86 


0* 


222,04 j 


gl4 


0,86 


0 




gl5 


0,85 


0* 


68,52 


gl5 


0,85 


0 




gl6 


-0,85 


0* 


546,97 


gl6 


-0,85 


0 




gl7 


0,84 


0* 


1242,77 


g!7 


0,84 


0 




gl8 


-0,84 


0* 


162,61 


gl8 


-0,84 


0 




gl9 


-0,83 


o* 


385,39 


gl9 


-0,83 


0 




g20 


0,46 


o* 


105,38 


g20 


0,46 


0 





Table A. Analysis of 18 samples class A versus 85 samples class non-A. On the 
left the analysis according to Golub is presented for 20 informative genes. The 
crossvalidation accuracy is 0,87, confidence 0,77. Samples, where crossvalidation 
5 failed, are listed. For each gene signal to noise ratio, p-value (significance 
obtained from permutation test) and decision limit are provided. On the right the 
same data set is analyzed using the protocol of the invention. By selection of 3 
genes (marked with asterisks) out of the top 20 genes and selecting optimized 
decision limits, the crossvalidation accuracy reaches 0,96, confidence 0,88. 
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Table 20 

1. Analyse: major leukemia 
marrow 

ALL - AML 

accuracy 
confidence 

1 41220_at 

2 36996_at 

3 33944_at 

4 41809_at 

5 39062_at 

6 36021 _at 

7 40282_s_at 

8 39801 _at 

9 41808_at 

10 38791 at 



types versus normal bone 

samples: 18 / 59 pValue selected 
0.974025974025974 
0.949191799863432 

1.43161207339479 0 

-1.41523985920205 0 

-1.36856428236618 0 

1.34726978852919 0 

-1.32477468024042 0 

1 .29482788383042 0 

-1 .25276275727203 0 

-1.18216117554755 0 

1.16535104461878 0 

-1.16212420300011 0 



WO 03/039443 

11 38705_at 

12 38017_at 

13 38233_at 

14 4008 1_at 

15 33414_at 

16 36644_at 

17 1497_at 

18 34670_at 

19 39689_at 

20 36553_at 

ALL - BM 

accuracy 
confidence 

1 32775_r_at 

2 38858_at 

3 33860_at 

4 33944 at 
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1.10578452683281 0 

1.09519463190211 0 

-1.08734958364712 0 

-1 .06895950257537 0 

-1.06893939139052 0 

-1.06866972901421 0 

-1.05371604908866 0 

-1.0471297974693 0 

-1.02349800799274 0 

0.461827371901751 0 

samples: 18/8 
1 

0.973577941615687 
-1.99123472883631 0 
-1.89835994048167 0 
-1.8349786493313 0 
-1.79527299060519 0 



WO 03/039443 

5 32800_at 

6 35204_at 

7 38112_g_at 

8 38735_at 

9 137_at 

10 1495_at 

11 36661_s_at 

12 38225_at 

13 39860_at 

14 32166_at 

15 32530_at 

16 35355_at 

17 1529_at 

18 36790_at 

19 2045_s_at 

20 36553_at 

ALL - CLL 



PCT/EP02/12303 

659 

-1.78206927960542 0 

-1.77698316964481 0 

-1.75527325798005 0 

-1.75431905717345 0 

-1.75261998994426 0 

-1.73006006400362 0 

-1.70636382014738 0 

-1.66016477586249 0 

1.65854625573936 0 

-1.65691236756089 0 

1.64232673980553 0 

1.62316234982832 0 

-1.60938224689727 0 

-1.6093508135706 0 

-1.59880026139776 0 

0.997704664996536 0 

samples: 18/8 
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accuracy 
confidence 

1 484_at 

2 38577_at 

3 2019_s_at 

4 3381 2_at 

5 34663_at 

6 36894_at 

7 39670_at 

8 41660_at 

9 39165_at 

10 31870_at 

11 34871_at 

12 34830_at 

13 31936_s_at 

14 1062_g_at 

15 41847_at 

16 1217_g_at 



8NSDOCID: <WO 03039443A2J_> 



PCT/EP02/123O3 

660 

1 

1 

-2.96991037890552 0 

-2.76720679743789 0 

-2.66997144118244 0 

-2.65004620998946 0 

-2.56576070575816 0 

-2.56382539311197 0 

-2.48013356223836 0 

-2.4799018378336 0 

-2.3871739157192 0 

-2.36838597731039 0 

-2.36033294682702 0 

-2.33471776306134 0 

-2.25951360532653 0 

-2.25550068155602 0 

-2.23298915825072 0 

-2.21920314115838 0 



WO 03/039443 



PCT/EP02/ 12303 



17 1529_at 

18 41796_at 

19 32597_at 

20 33266 at 



-2.17991363072808 



-2.17392867117507 



-2.13552223797253 



1.44232706973843 



ALL - CML 

accuracy 
confidence 

1 36809_at 

2 3731 1_at 

3 36766_at 

4 38894_g_at 

5 39179_at 

6 38893_at 

7 37897_s_at 

8 41809_at 

9 36963_at 

10 39301 at 



samples: 18/10 
1 



-2.79788870256583 

-2.20009414203519 

-2.15356495523503 

-2.11314073543331 

-2:08890598787237 

-2.05723533682216 

-2.05026870146261 

1 .98458593845403 

-1.95232400595449 

-1.91549367394028 



0 
0 



661 



BNSDOCID: <WO 03039443A2 J_> 



WO 03/039443 

11 4061 0_at 

12 38879_at 

13 41338_at 

14 39968_at 

15 33371 _s_at 

16 37149_s_at 

17 38895_i_at 

18 41220_at 

19 37099_at 

20 40159_r_at 

AML - BM 

accuracy 
confidence 

1 3671 0_at 

2 32775_r_at 

3 3282 1_at 

4 37149_s_at 

BNSOOCID: <WO 03039443A2J_> 



PCT/EP02/12303 

1.91246286924336 0 

-1.87541355348469 0 

1.86506063801814 0 

-1.81415292791782 0 

-1.81111388769192 0 

-1.77170759525375 0 

-1.77027078711718 0 

1 .75387842844952 0 

-1.74839569592051 0 

-1.35163593608562 0 

samples: 59/8 
1 

0.995997161972555 

-2.29217042550277 0 

-2.24397275344625 0 

-1.98392215005915 0 

-1 .94686927462724 0 

662 



WO 03/039443 

5 3701 5_at 

6 36894_at 

7 38735_at 

8 33752_at 

9 34654_at 

10 1115_at 

11 31859_at 

12 1980_s_at 

13 36464_at 

14 38858_at 

15 38225_at 

16 39170_at 

17 39929_at 

18 36021 _at 

19 32259_at 

20 41138_at 

AML - CLL 



PCT/EP02/12303 

-1.89128914250756 0 

-1.80507021485339 0 

-1 .78884482794867 0 

-1.77319451495748 0 

-1 .67560279229506 0 

-1.62050106692579 0 

-1.52416410078922 0 

1.51765867172316 0 

-1 .49540428238676 0 

-1.48511074361835 0 

-1.4790110074487 0 

-1.45392862675606 0 

-1.44420537588163 0 

-1.42322337311917 0 

-1.41395952112425 0 

1.25347154740786 0 

samples: 59/8 

663 



BNSDOCID: <WO 03039443A2_I_> 



WO 03/(139443 

accuracy 
confidence 

1 36239_at 

2 41220_at 

3 1096_g_at 

4 36155_at 
1 5 38578_at 

6 34871_at 

7 38006_at 

8 41165_g_at 

9 1105_s_at 

10 41166_at 

^ 11 31936_s_at 

12 3381 2_at 

13 41796_at 

14 36894_at 

15 38577_at 

16 38666_at 

BMSDOCID: <WO 03039443A2J_> 



PCT/EP02/12303 

1 

1 

-3.31028543322741 0 

-3.2480863078754 0 

-3.1269759462136 0 

-3.08191178811872 0 

-3.06174627261543 0 * 

-3.02230826662657 0 

-2.88939117591885 0 

-2.87065891259428 0 

-2.81674515012354 0 

-2.75359060509193 0 

-2.7527757757001 1 0 

i 

-2.74332185979714 0 

-2.71763998725163 0 

-2.64049311405919 0 

-2.63072228466709 0 

-2.61485585331331 0 

664 



WO 03/039443 PCT/EP02/12303 

17 32597_at -2.61456952032068 0 

18 41847_at -2.59409998700552 0 

19 34830_at -2.5698161906968 0 

20 33266 at 1.336807541194 0 



AML - CML 

accuracy 
confidence 

1 36464_at 

2 32821 _at 

3 31859_at 

4 37149_s_at 

5 3671 0_at 

6 34546_at 

7 33530_at 

8 3591 9_at 

9 37099_at 

10 36165 at 



samples: 59/10 
1 

0.956293899622379 

-2.85754269919935 

-2.55228282645443 

-2.27761090295808 

-2.11494842606363 

-1.94749857224656 

-1.85596431666606 

-1.82614283985583 

-1.74217437339032 

-1.7184463713752 

1.7091639497163 



665 



BNSDOCID: <WO 03039443A2_I_> 



WO 03/039443 

11 37054_at 

12 31381_at 

13 37579_at 

14 33371 _s_at 

15 1117_at 

16 38894_g_at 

17 31793_at 

18 4041 9_at 

19 37926_at 

20 40159_r_at 

BM - CLL 

accuracy 
confidence 

1 39070_at 

2 37755_at 

3 33963_at 

4 38415 at 



PCT/EP02/12303 

-1.69705232668157 0 

-1 .6873086898061 0 

-1.68217107035442 0 

-1 .67508882502583 0 

-1.67009743506797 0 

-1.66951302748224 0 

-1.63477764661912 0 

-1.62639286532631 0 

-1.59510369191926 0 

i 

-0.830070056055723 0 

samples: 8/8 

1 

1 

6.29661458968093 0 * 

4.71476584328837 0 

4.63206684324173 0 

-4.49254544394577 0 

666 



03039443A2 I > 



WO 03/039443 



PCT/EP02/12303 



5 36123_at 

6 37615_at 

7 38116_at 

8 3841 4_at 

9 41220_at 

10 34871 _at 

11 35643_at 

12 1096_g_at 

13 33386_at 

14 1884_s_at 

15 37149 s at 



16 
17 
18 
19 
20 



820_at 
38269_at 
35995_at 
33358_at 
33284 at 



4.12198883271914 0 



3.82297014835908 0 



3.70165567234484 0 



3.60409203763551 0 



-3.59882846329979 0 



-3.59319336097498 0 



3.56571508955085 0 



-3.51858708122275 0 



3.50471765190995 0 



3.48564292772594 0 



3.4769522926405 0 



3.37687376127416 0 



-3.31356010895055 0 



3.26267624054277 0 



3.25001188548107 0 



3.23482135945872 0 



BM -CML 



samples: 8/10 



667 



BNSDOCID: <WO 03039443A2J_> 



WO 03/039443 

accuracy 
confidence 

1 37625_at 

2 41609_at 

3 36661 _s_at 

4 1911_s_at 

5 854_at 

6 36773_f_at 

7 3501 6_at 

8 33274_f_at 

9 38112_g_at 

10 38194_s_at 

11 41165_g_at 

12 33273_f_at 

13 39179_at 

14 432_s_at 

15 36588_at 

16 39968_at 

BNSDOCID: <WO 03039443A2_I_> 



PCT/EP02/12303 w ^ 

1 

1 

2.87622426554922 0 

2.52321020501761 0 

2.51057654386246 0 

-2.42145340446397 0 

2.37696256335487 0 

2.32488494287137 0 

2.24094140253387 0 

2.21051775352204 0 

2.13305817504128 0 

2.09822821859324 0 

2.09291822272078 0 

2.0687446585274 0 

-2.04974106118371 0 

2.01424464352775 0 

1 .99886555057795 0 

-1.98085064661371 0 

668 



WO (13/039443 

17 1385_at 

18 36629_at 

19 38728_at 

20 38472_at 

CLL - CML 

accuracy 
confidence 

1 1105_s_at 

2 41609_at 

3 1096_g_at 

4 3421 0_at 

5 36155_at 

6 36766_at 

7 41220_at 

8 41165 _g_at 

9 37625_at 

10 37027 at 



PCT/EP(I2/12303 

1 .98022588084225 0 

-1.9763991250365 0 

1 .95859957483225 0 

1.95627106051459 0 

samples: 8/10 

1 

1 

6.65345823459692 0 

5.09272743129851 0 

4.79791769602114 0 

4.5885400157468 0 

4.51821220572632 0 

-4.38087516961473 0 

4.30356291392085 0 

4.29933289075313 0 

4.27214024229386 0 

4.17726581707744 0 



669 



WO ((3/039443 



PCT/EP02/12303 



11 34871 _at 

12 38095_i_at 

13 38578_at 

14 38116_at 

15 35643_at 

16 38833_at 

17 41164_at 

18 37344_at 

19 39670_at 

20 35016 at 



4.11725674890371 0 

4.01294758950756 0 

3.96024474623017 0 

-3.93637939332745 0 

-3.90694033464584 0 

3.90073371467641 0 

3.89237729890143 0 

3.8687581898534 0 

3.86448376068684 0 

3.74007150430317 0 



1 

2 



2. Analyse: analysis of AML subtypes according to the WHO 
nomenclature 



AMLMLL - 

accuracy 
confidence 
34306_at 
36881 at 



samples:- 10 / 45 



0.92125170098711 



1.36682833853864 0 



1.25743716610113 0 



670 



BNSDOCID: <WO 03039443A2J_> 



WO 03/039443 

3 38994_at 

4 38485_at 

5 32696_at 

6 1983_at 

7 37026_at 

8 138_at 

9 3881 2_at 

10 33284_at 

11 32232_at 

12 3992 1_at 

13 34679_at 

14 37992_s_at 

15 37029_at 

16 40775_at 

17 36709_at 

18 37809_at 

19 38097_at 

20 36608_at 

BNSDOCID: <WO 03039443A2_I_> 



PCT/EP02/12303 

1.25633105431216 0 

1.20820491820515 0 

1 .20289326580336 0 

-1.20116171703008 0 

-1.18461413291823 0 

1.18310205413783 0 

-1.17846157492535 0 

-1.16681898560395 0 

1.14845507137154 0 

i 

1.13232410733091 0 

-1.12714040987389 0 

1.11986637618528 0 

1.06646924971963 0 

-1.06615341562387 0 

1.04614233632581 0 

1.03749230715704 0 

1.03525254247508 0 
0.747544727295107 0 

671 



WO 03/039443 

AMLMLL - INV16 

accuracy 
confidence 

1 38812_at 

2 37407_s_at 

3 35282_r_at 

4 37026_at 

5 33856_at 

6 32174_at 

7 33284_at 

8 38653_at 

9 1983_at 

10 32696_at 

11 767_at 

12 35329_at 

13 3688 1_at 

14 40767 at 



PCT/EP02/12303 

samples: 10/10 

1 

1 

-3.34686451971904 0 

-3.22294767554078 0 

2.34298696520172 0 

-2.25660818336648 0 

2.25212063750729 0 

-2.16867472363265 0 

-2.14901777919516 0 

-2.09296931036988 0 

-2.06674088426528 0 

2.03331671439074 0 

-1.99007511677258 0 

1.92663715318122 0 

1.88821561232545 0 

-1.84600140068058 0 

672 



03039443A2J_> 



WO 03/039443 

15 36095_at 

16 538_at 

17 3331 9_at 

18 38485_at 

19 38747_at 

20 38994_at 

AMLMLL - OTHER 

accuracy 
confidence 

1 36980_at 

2 100_g_at 
i 3 38994_at 

4 37029_at 

5 37026_at 

6 3741 7_at 

7 39993_at 

8 39118_at 

BNSDOCID: <WO 03039443A2J_> 



-1.77023425026019 
-1 .75720288873792 
1 .6982626829354 
1.69464482881744 
-1.69240449076905 
1.50803351291881 



PCT/EP02/12303 

0 
0 
0 
0 
0 
0 



samples: 10/10 
7 

0.972144217378764 

-1.34035598763443 0 

1.32781895440119 0 

1.27848227020726 0 

1.27656800999718 0 

-1.24955259337174 0 

1.19110502379759 0 

-1.18099046398082 0 

-1.1584453284446 0 

673 



WO 03/039443 



PCT/EP02/12303 



9 36881 _at 

10 3425 1_at 

11 3581 3_at 

12 138_at 

13 36945_at 

14 40281_at 

15 35941_f_at 

16 36952_at 

17 37403_at 

18 33689_s_at 

19 35372_r_at 

20 32072 at 



1.12737149627183 0 
-1.12590878042921 0 
1.10960381779872 0 
1.09814796011793 0 
1.09593061163621 0 
1 .083400522626 0 
1.0833686449051 0 
1 .069544205786 0 
-1.06943971961994 0 
1 .06337639894231 0 
-1.05302441823616 0 
1 .02664886940357 0 



1 
2 



AMLMLL - T1517 

accuracy 
confidence 
39649_at 
38435 at 



samples: 10/16 
1 



3.23957503042803 0 



2.60733219271303 0 



674 



BNSOOCID: <WO 03039443A2_I_> 



WO ((3/039443 



PCT/EP02/ 12303 



3 38097_at 

4 32229_at 

5 38487_at 

6 32696_at 

7 38485_at 

8 33284_at 

9 38824_at 

10 41138 at 



11 
12 
13 
14 
15 



37967_at 
33866_at 
32543_at 
1983_at 
35823 at 



16 36749_at 

17 38063_at 

18 39814_s_at 

19 36843_at 

20 38992 at 



2.52025252941371 0 



2.40682119042641 0 



-2.31348932548076 0 



2.21878031159682 0 



2.10692284528305 0 



-2.09299435714406 0 



2.08124417022995 0 



-2.03312026813146 0 



1.92254422115649 0 



-1.91543446589406 0 



-1.87715920226866 0 



-1.87444715294745 0 



-1.86189545486519 0 



-1.81025193870165 0 



-1 .7878938995328 0 



•1.75490742013487 0 



1.74046122003025 0 



1.71753799928796 0 



675 



BNSDOCID: <WO 03039443A2_I_> 



WO 03/039443 



PCT/EP02/12303 



AMLMLL - T821 



samples: 10/9 



accuracy 
confidence 

1 36881_at 

2 32323_at 

3 3781 1_at 

4 38391_at 

5 3341 2_at 

6 33284_at 

7 33856_at 

8 38097_at 

9 34679_at 

10 37399_at 

11 3657 1_at 

12 35638_at 

13 32696_at 

14 32184 at 



2.29294544811647 0 

-1.98347658852059 0 

-1.98247325351143 0 

1 .94083259845207 0 

1.92827460657744 0 

-1.8385557317965 0 

1.77909423724864 0 

1.75444250975416 0 

-1.72476579402037 0 

-1.70947276971912 0 

1.65482279043264 0 

-1.65089605723885 0 

1.63638794888669 0 

1.62897781786406 0 



676 



BNSDOCID: <WO 03039443A2_I_> 



WO 03/039443 

15 1911_s_at 

16 34306_at 

17 138_at 

18 41694_at 

19 32232_at 

20 36608_at 

INV16 - 

accuracy 
confidence 

1 37407_s_at 

2 767_at 

3 245_at 

4 35282_r_at 

5 38465_at 

6 36095_at 

7 32174_at 

8 1385_at 

SNSOOCID: <WO 03039443A2_I_> 



PCT/EP02/12303 

-1.61083786198679 0 

1.59626133274337 0 

1.59241136884495 0 

-1.55909099909815 0 

1.54494819348846 0 

1.1358211663482 0 

samples: 10/45 

1 

1 

3.02509409963287 0 

1.85632628490303 0 

1.70794453836984 0 

-1.55233894025198 0 

1.52686366669143 0 

1.40576248502182 0 

1.39467340729953 0 

1.35491176363704 0 

677 



WO 03/039443 

9 41609_at 

10 36607_at 

11 34210_at 

12 33731_at 

13 2019_s_at 

14 40456_at 

15 277_at 

16 931 _at 

17 37762_at 

18 35230_at 

19 34780_at 

20 41200_at 

INV16- OTHER 

accuracy 
confidence 

1 37407_s_at 

2 37600_at 

_03039443A2_I_> 



PCT/EP02/12303 

1.33680396130546 0 

1.31728883029627 0 

1.25533053163606 0 

1.18379724417068 0 

1.1724338503802 0 

-1.17211851173852 0 

1.16888798816433 0 

1.16565119574672 0 

1.14260401763247 0 

1.13832458283537 0 

1.12142169279465 0 

1.10294533672324 0 

samples: 10/10 
1 
1 

3.18736190788495 
2.62690062253569 

678 



0 
0 



WO 03/039443 

3 767_at 

4 41609_at 

5 32174_at 

6 41723_s_at 

7 38833_at 

8 38465_at 

9 38095_i_at 

10 1230_g_at 

11 1252_at 

12 32434_at 

13 40856_at 

14 37762_at 

15 37344_at 

16 3501 6_at 

17 35078_at 

18 37001 _at 

19 33920_at 

20 245_at 



PCT/EP02/12303 

2.03572561609575 0 
1.96219869102861 0 
1.91764639873215 0 
1.8341348306376 0 
1.8197922525705 0 
1.80863491318623 0 
1.80209304301982 0 
1.64339686952341 0 
1.59751061992943 0 
-1.5885630305188 0 
1 .50737403239702 0 

1.49715599199852 0 

1.46932515754743 0 

1.44723563131209 0 

1.44013020473382 0 

1.43580114615375 0 

1.39389220008214 0 

1.37591797605968 0 

679 



WO ((3/039443 



PCT/EP02/12303 



INV16 -T1517 

accuracy 
confidence 

1 245_at 

2 38833_at 

1 3 38095_i_at 

4 39649_at 

5 38096_f_at 

6 3501 6_at 

7 38435_at 

8 37039_at 
y 9 38465_at 

10 37407_s_at 

11 3421 0_at 

12 41723_s_at 

13 41471_at 

14 34789 at 



samples: 10/16 



4.57658935261639 0 

4.25223366871621 0 

3.57578878481709 0 

3.23025693895729 0 

3.14225313100266 0 

3.0787769409051 0 

3.03350943051849 0 

2.97439961013438 0 

2.96526060073085 0 

i 

2.87881711895892 0 

2.84796613729927 0 

2.82100515486823 0 

2.80737102015788 0 

2.75455608102168 0 



680 



BNSDOCID: <WO 03039443A2_I_> 



WO 03/039443 



PCT/EP02/12303 



15 
16 

17 
18 
19 
20 



1052_s_at 
3660 1_at 
41096_at 
41609_at 
37344_at 
40698 at 



2.6975735697327 0 



2.67187833778929 0 



2.59449996104956 0 



2.57402348086536 0 



2.56147220347162 0 



2.54689030859799 0 



INV16 -T821 

accuracy 
confidence 

1 37407_s_at 

2 2019_s_at 

3 36607_at 

4 41535_at 

5 40198_at 

6 35264_at 

7 36661 _s_at 

8 36095 at 



samples: 10/9 

7 

1 

3.01984756968935 0 

2.27319677276044 0 

2.26121735205867 0 

2.20248916475367 0 

1.86811562002606 0 

1.84583120098715 0 

1.8131267115673 0 

1.79878279442616 0 



681 



BNSDOCID: <WO 03039443A2J_> 



WO 03/039443 

9 32747_at 

10 4071 8_at 

11 37326_at 

12 34780_at 

13 3961 0_at 

14 33390_at 

15 767_at 

16 32080_at 

17 39358_at 

18 37747_at 

19 245_at 

20 3373 1_at 

> 

OTHER - 

accuracy 
confidence 

1 34251 _at 

2 3701 8_at 

BNSDOCID: <WO 03039443A2_I_> 



PCT/EP02/12303 

1.78653441127702 0 

1.77233756136773 0 

1 .76038682206377 0 

1 .74785276488982 0 

1.74122046729845 0 

1.73740374189728 0 

1.72097134093781 0 

-1.70225469167763 0 

1 .695944330956 0 

i 

1.67498682162383 0 

1.65171897393137 0 

1.62582771838167 0 

samples: 10/45 
0.981818181818182 
0.981818181818182 

1.12590878042921 0 

1.10268143578403 0 

682 



WO 03/039443 

3 33920_at 

4 35941 _f_at 

5 256_s_at 

6 37333_at 

7 32434_at 

8 1959_at 

9 37147_at 

10 33284_at 

11 40864_at 

12 35154_at 

13 2047_s_at 

14 41763_g_at 

15 36900_at 

16 3335 1_at 

17 36936_at 

18 37263_at 

19 38695_at 

20 40509 at 



PCT/EP02/12303 

-0.956076703831482 0 
-0.90383440767488 0 
-0.858111816204111 0.01 
0.8275447098375 0 
0.805850409739795 0 
0.796925434525945 0 
-0.773953141034502 0 
-0.771438360960095 0 
-0.770641950715737 0 
-0.764874807980337 0 
-0.748787188622726 0 
0.748545954599463 0 

i 

-0.74265889729539 0 

0.742532920653334 0 

0.733922207115175 0 

0.729044492680672 0 

-0.71970619250199 0 

0.712799214281053 0 
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WO 03/039443 

OTHER-T1517 

accuracy 
confidence 

1 39649_at 

2 40718_at 

3 39775_at 

4 34789_at 

5 32543_at 

6 34110_g_at 

7 38487_at 

8 40493_at 

9 40698_at 

10 41273_at 

11 33284_at 

12 32434_at 

13 39755_at 

14 4081 7_at 

03039443A2_I_> 



PCT/EP02/12303 

samples: 10/16 

1 

1 

3.30712305398492 0 

-2.52673625599382 0 

-1.97869028082043 0 . 

1.95413968158198 0 

-1.91464724323262 0 

-1.77793772892734 0 

-1.76846306892822 0 

-1 .6739728895294 0 

1.65346991374979 0 

i 

-1.61494643368443 0 

-1.59460520214407 0 

1 .57854380538056 0 

-1.57527382190822 0 

-1.54437681308404 0 

684 



WO 03/039443 



PCT/EP02/12303 



15 37408_at 

16 33102_at 

17 1752_at 

18 37954_at 

19 38791 _at 

20 34210 at 



-1.53683716438534 0 



1.53530070976794 0 



-1.52886252404363 0 



-1.5109989255419 0 



-1.48386160940786 0 



1 .44938674947878 0 



OTHER - T821 

accuracy 
confidence 

1 32323_at 

2 3781 1_at 

3 34512_at 

4 37809_at 

5 40585_at 

6 33284_at 

7 38096_f_at 

8 36973 at 



samples: 10/9 



0.994837795579117 



-1.94417836607133 0 



-1.61775423684388 0 



-1.39224768490919 0 



1.31872220085798 0 



1.31087716877391 0 



-1.29020750912798 0 



-1.25756828218199 0 



1.23300907353238 0 
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BNSDOCID: <WO 03039443A2J_> 



WO 03/039443 



PCT/EP02/12303 



9 35940_at 

10 38808_at 

11 37333_at 

12 2047_s_at 



-1 .20559489876876 0 

-1.18153953430958 0 

1.17979557242332 0 

-1.17934561734238 0 



1 3 AFFX-HUMRGE/M1 0098_M_at -1 . 1 5756874447998 0 



14 958_s_at 

15 34251_at 

16 38963_i_at 

17 35638_at 

18 38095_i_at 

19 37657_at 

20 4071 8_at 



1.1376378956 



1.12590878042921 0 



-1.12200244261352 0 



-1.12014975713644 0 



-1.10572791785236 0 



-1.08109523793911 0 



0.918288266614732 0 



1 

2 



T1517 - 

accuracy 
confidence 
39649_at 
40718 at 



samples: 16/39 
1 

0.99537936517205 
-3.29831494694965 0 
2.08511115510612 0 
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BNSDOCID: <WO 03039443A2 I > 



WO 03/039443 

3 38487_at 

4 34110_g_at 

5 34789_at 

6 38435_at 

7 32543_at 

8 41273_at 

9 40493_at 

10 38096_f_at 

11 41471_at 

12 37954_at 

13 33454_at 

14 38791_at 

15 41096_at 

16 1752_at 

17 38833_at 

18 210_at 

19 3501 6_at 

20 37669_s_at 

BNSDOCID: <WO 03039443A2J_> 



PCT/EP02/12303 

2.00096141225403 0 
1 .76946763277471 0 
-1.70643590139573 0 
-1.66171359352607 0 
1.59263172662039 0 
1.54397178886438 0 
1.48615605789895 0 
-1.44819099507958 0 
-1.41828217671556 0 

1.41406051783378 0 

1 .39832642238269 0 

1.38083401177366 0 

-1.36447605743032 0 

1.36363148385769 0 

-1.34891815086687 0 

-1.34320613976446 0 

-1.33685666840696 0 

0.806171390298286 0 
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WO 03/039443 

T1517 -T821 

accuracy 
confidence 

1 40718_at 

2 39649_at 

3 40698_at 

4 38096_f_at 

5 39775_at 

6 38487_at 

7 33121_g_at 

8 3501 6_at 

9 38095_i_at 

10 32506_at 

11 38833_at 

12 34110_g_at 

13 41096_at 

14 38391_at 

BNSDOCID: <WO 03039443A2_I_> 



PCT/EP02/12303 

i 

samples: 16/9 

1 

1 

3.23930438679201 0 
-3.19787649222746 0 
-3.08468795621776 0 
-2.65789275034265 0 
2.36672318019601 0 
2.29609697942891 0 
-2.21859945215003 0 
-1.96619441751917 0 
-1.91141006266887 0 

i 

-1.88419268611225 0 

-1.85079137548356 0 

1 .85075648886345 0 

-1.8279606340398 0 

1.81966548635468 0 

688 



WO 03/039443 

15 35766_at 

16 34789_at 

17 41609_at 

18 41273_at 

19 37344_at 

20 35340_at 

T821 - 

accuracy 
confidence 

1 3781 1_at 

2 3839 1_at 

3 35638_at 

4 32323_at 

5 35940_at 

6 36973_at 

7 35264_at 

8 361_at 

BNSDOCID: <WO 03039443A2J_> 



PCT/EPO2/12303 

1.75448820173577 0 
-1.71046499814806 0 
-1.710217028597 0 
1.70663861548637 0 
-1.69281730857466 0 
1.69006593600413 0 

samples: 9 / 46 
1 

0.920027437519393 

1 .59605072597366 0 

-1.3878317468135 0 

1.38266687057184 0 

1.35561326937612 0 

1.20094762805468 0 

-1.17706120019907 0 

-1.10943417036523 0 

1.07264489085601 0 
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WO 03/039443 



PCT/EP02/12303 



9 36802_at 

10 3451 2_at 

11 35939_s_at 

12 3906 1_at 

13 37326_at 

14 32747_at 

15 1096_g_at 

16 33121_g_at 

17 41535_at 

18 37023_at 

19 38780_at 



-1.02457173409803 0 
1.0204248889286 0 
0.955125481450532 0 
-0.947115729172708 0 
-0.935233446940231 0 
-0.930843404980031 0 
0.927398573506987 0 
0.923255276498333 0 
-0.916652918883009 0 

i 

-0.915934376270622 0 
-0.915816135921152 0 



1 
2 



3. Analyse: analysis of ALL subtypes 



ALL814 - 

accuracy 
confidence 
1402_at 
37747 at 



samples: 3/14 



2.08175003568258 0.01 



2.03525291878921 0 



690 



BNSDOCID: <WO 03039443A2_I_> 



WO 03/039443 

3 37701_at 

4 2042_s_at 

5 35260_at 

6 1476_s_at 

7 3261 6_at 

8 932_i_at 

9 36139_at 

10 39730_at 

11 2024_s_at 

12 38730_at 

13 3702 1_at 

14 933_f_at 

15 41396_at 

16 33856_at 

17 33905_at 

ALL814 - ALLMLL 

accuracy 



PCT/EP02/12303 

1 .72252048284758 0.01 
-1.63373519277864 0 
-1.63074548017429 0 
-1.58695680100552 0 
1.57366205688451 0 
-1.57233356229385 0 
-1.5665197341194 0 

-1.5464188647031 0.01 

1.48918618192137 0.01 

i 

-1.45475301578891 0.01 

1.45440716568168 0 

-1.42175300666003 0 

-1.38553827410353 0 

1.37781449850137 0 

1.3359321134002 0,01 

samples: 3/4 
1 
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BNSDOCID: <WO 03039443A2J_> 



WO 03/039443 

confidence 

1 2042_s_at 

2 36638_at 

« 

3 1474_s_at 

4 706_at 

5 38994_at 

6 34785_at 

7 36798_g_at 

8 41191_at 

9 39827_at 

10 585_at 

11 2069_s_at 

12 529_at 

13 307_at 

14 32842_at 

15 36873_at 

16 41747_s_at 



PCT/EP02/12303 

1 

-8.02342094912714 0.05 
-4.58127332202829 0.02 
-4.09073883260025 0.03 
-3.85465872728703 0.02 
-3.74888987537358 0.02 
-3.3713121522884 0.03 
-3.35179120581516 0 
-3.25108066126486 0.01 

-3.14944671750591 0.02 

-3.09495658747361 0.04 

-3.01980201927276 0.02 

2.93352551602605 0.01 

2.92372091093276 0.03 

-2.85788132874268 0.01 

-2.84199424616029 0.05 

-2.76420890085279 0.01 



BNSDOCID: <WO 03039 4A3A2J_> 



WO 03/039443 

ALL814 - ALLPH 

accuracy 
confidence 

1 35260_at 

2 41177_at 

3 36638_at 

4 38767_at 

5 37747_at 

6 39327_at 

7 1476_s_at 

8 1636_g_at 

9 37159_at 

10 932_i_at 

11 38994_at 

12 1402_at 

13 3231 9_at 

14 2047_s_at 

15 40936 at 



PCT/EP02/12303 

samples: 3/7 

1 

1 

-2.82900147590099 0.01 
2.74938757132087 0 
-2.41641563333395 0.01 
-2.40416010568405 0.01 
2.39180707441687 0 
-2.27856651600595 0 
-2.09613072968554 0.02 
-2.00644064085741 0.01 
-1.98769010758559 0 
-1.95869341917388 0 
-1.94783634512686 0.02 

1.91294884399227 0 

-1.89265256004687 0.01 

-1.89118154910995 0 

-1.84560154590494 0 

693 



03039443A2J_> 



WO 03/039443 

16 39730_at 

17 39089_at 

18 41734_at 

19 39755_at 

ALL814 - ALLT 

accuracy 
confidence 

1 36103_at 

2 35350_at 

3 41654_at 

ALLMLL - 

accuracy 
confidence 

1 36873_at 

2 36638_at 

3 33358 at 



PCT/EP02/12303 

-1.84157870686046 0.03 
-1.83344549691856 0 
-1.82721322918029 0.01 
-1.80690655801942 0.03 

samples: 3/3 

1 

1 

3.43807291883285 0.01 

3.3144051465682 0.04 

-3.10101195313041 0.04 

samples: 4/13 

1 

1 

2.83046978285357 0 
2.81082924166381 0 
2.41284542345819 0 
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WO 03/039443 



PCT/EP02/ 12303 



4 34785 at 



5 36667 at 



6 
7 
8 
9 



41503_at 
36690_at 
706_at 
32842 at 



10 41747_s_at 

11 32145_at 

12 36798_g_at 

13 41191_at 

14 40763_at 

15 585_at 

16 41470_at 

17 40786_at 

18 38037_at 

19 34583_at 

20 1140 at 



2.01800062696643 0 

2.00264026114972 0 

1.98278140811591 0 

1 .97928285044649 0.01 

1.96737076450007 0 

1 .88883935305863 0 

1.87619134543011 0 

1.87136682311337 0 

1.76515008934463 0.01 

1.7369773616741 0 

1.69441997605083 0 

1.66246850330104 0 

1.53671027688439 0 

1 .52293758990032 0 

1.50762328192198 0.01 

1 .46227478562542 0 

1.46133065433595 0 
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BNSDOCID: <WO 03O39443A2J > 



WO 03/039443 

ALLMLL - ALLPH 

accuracy 
confidence 

1 40723_at 

2 706_at 

3 41191_at 

4 36638_at 

5 36873_at 

6 36690_at 

7 33358_at 

8 36798_g_at 

9 36667_at 

10 41177_at 

11 40865_at 

12 37967_at 

13 34892_at 

14 40396_at 

15 1140_at 



PCT/EP02/12303 

samples: 4/7 

1 

1 

3.55841873411154 0 

3.32668949661753 0 

2.96542484991746 0 

2.84067512400178 0 

2.8258041781711 0 

2.54358082956883 0 

2.34079500414888 0 

2.21311555975342 0.01 

2.20932580996057 0.01 

2.1413450003396 0 

i 

2.11221046250059 0 

-2.08412772392346 0 

-2.08369333371306 0 

1 .96552338894503 0 

1.96261636340905 0 

696 



BNSDOCID: <WO 03039443A2J_> 



WO 03/039443 

16 33134_at 

17 32842_at 

18 41503_at 

19 32145_at 

20 40763_at 

ALLMLL - ALLT 

accuracy 
confidence 

1 2069_s_at 

2 41153_f_at 

3 41156_g_at 

4 33352_at 

5 36638_at 

6 1185_at 

7 37775_at 

8 1105_s_at 

9 41155 at 



PCT/EP02/12303 

-1.95145751139773 0 

1.84503319654406 0 

1.8284794750916 0 

1.82248695271075 0 

1.78804175908387 0.01 

samples: 4/3 

1 

1 

13.292701923441 0 

10.2582391724747 0 

5.75960819662385 0.01 

« 

4.58870845894255 0 

4.58127332202829 0.03 

4.45457229345442 0.01 

-4.11123301947466 0.02 

-4.04867441577307 0.03 

3.98831214950398 0.01 
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BNSDOCID: <WO 03039443A2_I_> 



WO 03/039443 PCT/EP02/ 1 2303 

10 38994_at 3.88924207710779 0.02 

i 

11 34785_at 3.83690018368942 0.01 

12 3281 9_at 3.567056723698 0.03 

13 3831 9_at -3.55471475398643 0.01 

14 2045_s_at 3.54943843148795 0.02 

15 40159_r_at 3.40746200289675 0.03 

16 39136_at -3.36701895470486 0.02 

17 1110_at -3.33969464270628 0.01 

18 3801 7_at 3.32515685260135 0.01 

1 i 

19 605 at -3.28310118648462 0.02 



4. Analyse: other analyses 



1 
2 
3 



ALLPH - 

accuracy 
confidence 
1389_at 
41734_at 
38336 at 



samples: 7/10 
1 



1.58617196971584 0 



1.55949651759221 0 



1.52692526781459 0 
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BNSDOCID: <WO 03039443A2_I_> 



WO 03/039443 



PCT/EP02/12303 



4 33134_at 

5 36878_f_at 

6 39755_at 

7 38833_at 

8 33924_at 

9 34362_at 

10 36536_at 

11 37344_at 

12 38095_i_at 

13 35260_at 

14 41177_at 

15 38096_f_at 

16 36773_f_at 

17 39824_at 

18 31898_at 

19 1636_g_at 

20 41609 at 



1.449713769608 0 

1 .36077477960263 0 

1.27483851783738 0 

1.2244093710462 0 

1.22263315100349 0 

1.1962046547055 0 

1.19336569573264 0 

1.18918159634593 0 

1.16331309702494 0 

1.12932649576543 0 

-1.07976913600882 0 

1.05014739949744 0.01 

1.0492226005037 0 

1 .03626825828771 0 

1.02765158070601 0 

1 .0227576964995 0 
0.997013086295946 0 



699 



BNSDOCID: <WO 03039443A2_L> 



WO 03/039443 



PCT/EP02/12303 



ALLPH - ALLT 

accuracy 
confidence 

1 1105_s_at 

2 3831 9_at 

3 38096_f_at 

4 37039_at 

5 35016 at 



samples: 7/3 
1 



6 
7 
8 
9 

10 
11 
12 



38833_at 
39262_at 
32649_at 
33821 _at 
41609_at 
38147_at 
38095 i at 



13 37739 at 



14 38894_g_at 



15 36638 at 



-4.05162267253209 0 

-3.66580320533053 0.03 

2.88234070062166 0.02 

2.80755189593111 0 

2.77774834690928 0.01 

2.75560710134977 0.01 

-2.59234132448068 0 

-2.55542908459441 0 

-2.5421725262322 0.01 

2.5109183575568 0 

-2.50135496035854 0 

2.48939716688646 0.02 

-2.44732228148107 0 

2.42422840620003 0.01 

2.41641563333395 0 
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BNSDOCID: <WO 03039443A2J_> 



WO 03/039443 



PCT/EP02/12303 



16 38361 _g_at 



17 2059 s at 



2.381 1 6082851 993 0.01 



-2.37547551809124 0.01 



18 33425_at 

19 38949 at 



-2.36441631934975 0 



-2.27455845085004 0 



20 39755_at 



2.26518913381284 0 



1 
2 
3 
4 
5 
6 
7 
8 
9 



ALLT- 

accuracy 

confidence 

3831 9_at 

33821 _at 

1105_s_at 

38147_at 

38949_at 

33425_at 

40407_at 

1110_at 

39136 at 



samples: 3/14 
1 



3.50494628126444 0 

2.86458211053638 0 

2.7919896009269 0 

2.2999876938771 0 

2.275851563485 0 

2.24691287113975 0 

2.23626457040595 0 

2.23213485898084 0 

2.21536950680885 0 



701 



BNSDOCID: <WO 03039443A2_1_> 



WO 03/039443 

10 41535_at 

11 2059_s_at 

12 39262_at 

13 34367_at 

14 3501 6_at 

15 38096_f_at 

16 37775_at 

17 3891 7_at 

18 33238_at 

19 1498_at 

20 41163_at 

ALLPHNEG - ALLPHPOS 

accuracy 
confidence 

1 38336_at 

2 33134_at 

3 39755 at 



PCT/EP02/12303 

2.20104335983474 0 

2.17544015063967 0 

2.14503507257872 0 

2.14251924457163 0 

-2.12822205034103 0 

-2.0151362322022 0 

2.0098435918241 0.01 

2.00844440766432 0 

2.00529430466423 0 

i 

1.98727437856937 0 

1.90540704553591 0 

samples: 11/7 

i 

1 

0.946908445764721 

-1 .5382379030083 0 

-1.30650437502273 0 

-1.28612797222091 0 
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BNSDOCID: <WO 03039443A2_I_> 



WO 03/039443 



PCT/EP02/12303 



4 1636_g_at 

5 38833_at 

6 41177_at 

7 34168_at 

8 38096 _f_at 

9 38095_i_at 

10 33924_at 

11 39756_g_at 

12 3831 2_at 

13 36878_f_at 

14 41193 at 



15 
16 
17 
18 
19 
20 



37384_at 

32706_at 

33441_at 

41547_at 

36773_f_at 

32649 at 



-1.04852861613762 0 
-1.04767357167583 0 
1 .041 54428480732 0 
-1.00022056220148 0 
-0.991139993364388 0 
-0.969632957696579 0 
-0.965257886268051 0 
-0.964856826724863 0 
-0.964240310753493 0 
-0.961827994429321 0 
-0.957818027300299 0 
-0.934644823332001 0 
0.916645445316056 0 
-0.910220742840358 0 
0.895837845645142 0 
-0.884139970624711 0 
0.833759113455451 0 



703 
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Table 21 



Affy 
met 
rix_l 
D 


Description_microarray 


Symbo 
l 


Description_Net 
Affx 


Chrom 
osome 


100 

_9_ 
at 


Y08200 /FEATURE= 
/UtrllNl l ivjiN— nor\A\DO i r\M nurnu 
sapiens mRNA for rab geranylgeranyl 
transferase, alpha-subunit 


RABG 


Rab 

nf^ranvlnpranvltr 

y c» i ai ly iy d ai ly in 

ansferase, 
alpha subunit . 


14q11. 

2 


105 

2_S 

_at 


M83667 /FEATURE=mRNA 

vr^cciMITir^KI— Ull IMMP1I ADA Unman 

NF-IL6-beta protein mRNA, complete 
cds 


CEBP 
n 


CCAAT/enhanc 
*=*r hindinn 

protein (C/EBP), 
delta 


8p11.2 
-p11.1 


106 
2_g 
_at 


U00672 /FEATURE^ 

/PiCCIMITIOKI— 1 innf^TO Unman 
/UtrllNl 1 IV^IN— UUUD/Z nuiiiciii 

interleukin-1 0 receptor mRNA, 
complete cds 


IL10R 
A 


interleukin 10 
rvappnjtnr alnha 

i cue yJ L\J 1 , aipi ia 


11q23 


109 
_at 


M28170 /FEATURE= 
/nPPiMiTlOW=HI IMP^PP Human cell 

surface protein CD19 (CD19) gene, 
complete cds 


CD19 


CD19 antigen 


16p11. 
2 


110 
5_s 
_at 


M 12886 /FEATURE= 
/DEFINITION=HUMTCBYY Human T- 
cell receptor active beta-chain mRNA, 
complete cds 


TRB 


T cell receptor 
beta locus 


7q35 


111 
0 a 


M2 1 624 /FE ATU RE=m RN A 
/DEFINITION=HUMTCRGC Human T- 


TRD 


T cell receptor 
delta locus 


14q11. 
2 
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BNSDOCID: <WO 03039443A2_I_> 



WO (13/039443 
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t 


cell receptor delta chain mRNA (VJC- 
region), complete cds 




delta locus 


2 


111 

5_a 
t 


M25897 /FEATURE=mRNA 
/DEFINITION=HUMPF4A. Human 
platelet factor 4 (PF4) mRNA, 
complete cds 


PF4 


piateiei Tacior h- 


q21 


111 
7_a 
t 


L27943 /FEATURE=mRNA 
/DEFINITION=HUMCYDE Homo 
sapiens cytidine deaminase (CDA) 
mRNA, complete cds 


CDA 


cytidine 
deaminase 


-p35 


114 
0_a 
t 


L25851 /FEATURE= 
/DEFINITION=HUMINAE Homo 
sapiens integrin alpha E precursor, 
mRNA, complete cds 


ITGAE 


"integrin, alpha I 
E (antigen 
CD103, human 
mucosal 
lymphocyte 
antigen 1; alpha 
polypeptide)" 


17p13 


118 
5_a 
t 


D49410 /FEATURE=expanded_cds 
/DEFINITION=HUMIL3RA1 2 Human 
gene for interleukin 3 receptor alpha 
subunit, exon 12 and partial cds 








121 

7_g 
_at 


X07109 /FEATURE=cds 
/DEFINITION=HSPKCB2A Human 
mRNA for protein kinase C (PKC) type 
beta II 


PRKC 
B1 


protein Kinase 
C, beta 1 


1 1fir>11 
2 


123 

o_g 

_at 


U78556 /FEATURE= 
/DEFINITION=HSU78556 Human 
cisplatin resistance associated alpha 
protein (hCRA alpha) mRNA, complete 


CRA 


cisplatin 

resistance 

associated 


1q12- 
q21 



705 
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t 


cds 




i 




125 
2_a 
t 


M73547 /FEATURE= 
/DEFINITION=HUMPOLLA Human 

nnlx/nn^i^ lnr*n^ fP)P1 n^n^^ mPMA 

complete cds 


D5S34 
6 


DNA segment, 
single copy 

fJIUUt; LINO- 

CAI/LNS-CAII 
(deleted in 
polyposis 


5q22- 
q23 


137 
_at 


U65404 /FFATURF= 
/DEFINITION=HSU65404 Human 
erythroid-specific transcription factor 
EKLF mRNA, complete cds 


Kl F1 
r\i_i i 


factor 1 
(erythroid) 


1 Qn1 ^ 

13- 

p13.12 


138 
5 a 

w CI 

t 


M77349 /FEATURE= 
/DEFINITION=HLJMTGFRItt Human 

transforming growth factor-beta 
induced gene product (BIGH3) mRNA, 
complete cds 


TGFBI 


transforming 

yiuwin iciwiur, 

beta-induced, 

68kD 


5q31 


138 
9_a 
t 


J03779 /FEATURE=mRNA 
/DEF!NITION=HUMCALLA Human 
common acute lymphoblastic leukemia 


MME 

t 


membrane 

metallo- 

endopeptidase 

/noi if 1 
\\ IcULIdl 

endopeptidase, 
enkephalinase, 
CALLA, CD10) 


3q25.1 
-q25.2 


138 
_at 


U66464 /FEATURE= 
/DEFINITION=HSU66464 Human 
hematopoietic progenitor kinase 
(HPK1) mRNA, complete cds 


MAP4 
K1 


mitnnpn- 

activated protein 
kinase kinase 
kinase kinase 1 


I I O. 
1- 

q13.4 


140 
2_a 
t 


M16038 /FEATURE= 
/DEFINITION=HUMLYN Human lyn 
mRNA encoding a tyrosine kinase 


LYN 


v-yes-1 
Yamaguchi 
sarcoma viral 


8q13 



706 
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t 


mRNA encoding a tyrosine kinase 




related 

oncogene 

homolog 




4_s 
_at 


1 I00^7« /PPATI IRP— rHcif^ 

/DEFINITION=HSU22376 Human (c- 
myb) gene, complete primary cds, and 
five complete alternatively spliced cds 






• 


6_s 
_at 


/DEFINITION=HSU22376 Human (c- 
myb) gene, complete primary cds, and 
five complete alternatively spliced cds 








149 
5_a 
t 


M34057 /FEATURE= 
/UtrlNI I lUN-nUIvl I orbl b numan 

transforming growth factor-beta 1 
binding protein mRNA, complete cds 


LTBP1 


latent 

■fro ncfnrminn 
LI dl lolUI 1 1 III iy 

growth factor 
beta ' binding 
protein 1 


2p22- 


149 
/_a 
t 


L04270 /FEATURE= 
/Utr I INI 1 ILJIN— nUlvl 1 l\irt\r\r' norno 

sapiens (clone CD18) tumor necrosis 
factor receptor 2 related protein 
mRNA, complete cds 


LTBR 


lymphotoxin 

(TNFR 
superfamily, 
member 3) 


12p13 


8_a 
t 


i n^i/ia /PPATI IRF= 
/DEFINITION=HUMTYRKIN Human 
protein tyrosine kinase related mRNA 
sequence 








152 
9_a 
t 


U50534 /FEATURE= 
/DEFINITION=HSU50534 Human 
BRCA2 region, mRNA sequence 
CG003 









707 
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163 
_at 


U07563 /FEATURE=Poly J _A_Site#1 

/nPPIMITinMzzH^ARI f^R? Human 

proto-oncogene tyrosine-protein kinase 
(ABL) gene, exon 1a and exons 2-10, 
complete cds 








175 
2_a 
t 


AD000092 /FEATURE=cds#6 
/DEFINITION=CH1 9HHR23 Homo 
sapiens DNA from chromosome 

iypiO.Z COoiniUo rwli£*+U, i\OU^/Z dMU 

R28549 containing the EKLF, GCDH, 
CRTC, and RAD23A genes, genomic 
sequence 








188 
4_s 
_at 


M15796 /FEATURE= 
/DEFINITION=HUMCYL Human cyclin 
protein gene, complete cds 


PCNA 


proliferating cell 
nuclear antigen 


20pter- 
p12 


191 

1 — S 


M60974 /FEATURE= 

/PiCTCTIKIITir^M— Wl llV/ini A nn^l^ Human 

growth arrest and DNA-damage- 
inducible protein (gadd45) mRNA, 
complete cds 


GADD 


growth arrest 

CllIU L-/ IN/A 

damage- 
inducible, alpha 


1p31.2 


9_a 
t 


riftRR7>4 /PPATI IRP— 
UOOD/4 /rCnl Ui\C- 

/DEFINITION=D88674 Homo sapiens 
mRNA for antizyme inhibitor, complete 
cds 


OA7IM 

i 


UI 1 1 1 LI in ic 

decarboxylase 

antizyme 

inhibitor 




0_s 
at 


vcqqcc . /FFATI IRF= 

/DEFINITION=HSNM23H2G H.sapiens 
RNA for nm23-H2 aene 




nnn-rnpta^tatif* 

1 l\J\ I I 1 1 V? lOOLu LIV-# 

cells 2, protein 
(NM23B) 
expressed in 


17a21 
3 


198 
3 a 


X68452 /FEATURE=cds 
/DEFINITION=HSCYCD2 H.sapiens 


CCND 
2 


cyclin D2 


12p13 



708 



WO 03/039443 



PCT/EP02/12303 



t 


mRNA for cyclin D2 


2 






201 
9_s 
_at 


M68892 /FfcATURfc- 
/DEFINITION=HUMINTB7 Human 
integrin beta-7 subunit mRNA, 
complete cds 


\TC1VZ7 


infonrin h^tii V 
iiiicyiiii, ucia i 


12a13 
13 

• 


202 
4_s 
_at 


M79321 /FEATURE 3 
/DEFINITION=HUMLYNTK Human 
Lyn B protein mRNA, complete cds 


LYN 


v-yes-1 

Yamaguchi 

sarcoma viicii 

related 

oncogene 

homolog 


8q13 


204 
2_s 
_at 


M15024 /FEATURE= 
/DEFINITION=HUMCMYBLA Human 
c-myb mRNA, complete cds 


MYb 


v-myu avian 
myeloblastosis 
viral oncogene 
homolog 


q23 


204 
5_s 
_at 


M16592 /FEATURE=mRNA 
/DEFINITION=HUMHCKB Human 
hemopoietic cell protein-tyrosine 
kinase (HCK) gene, complete cds, 
clone HK24 


HCK 


hemopoietic cell 
Kinase 


20q11- 
n19 


204 
7_s 
_at 


M23410 /FEATURE- 
/DEFINITION=HUMPLAKO Human 
plakoglobin (PLAK) mRNA, complete 
cds 


II ID 

JUr 


junciion 
plakoglobin 


17n91 


one 

9_s 
_at 


ivnfiftfti /FEATURE=mRNA 
/DEFINITION=HUMLCKAA Human 
lymphocyte-specific protein . tyrosine 
kinase (Ick) mRNA, complete cds 


LCK 


lymphocyte- 
specific protein 
tyrosine kinase 


1p35- 
p34.3 
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206 

Q Q 

_at 


L23805 /FEATURE^ 
/nFFlNITION=HUMCATENIN Human 
alphal (E)-catenin mRNA, complete 
cds 


CTNN 
A1 


catenin 
(cadherin- ' 
associated 
protein), alpha 1 
(102kD) 


5q31 


_at 


mq^R7R /FEATURE= 
/DEFINITION=HUMPLCB2A Homo 
sapiens phospholipase C-beta-2 
mRNA, complete cds 


PLCB2 


phospholipase 
C, beta 2 


15q15 


OAR 

_at 


M9^9an /FEATURE= 
/DEFINITION=HUMLNHR Human 
lymph node homing receptor mRNA, 
complete cds 


SELL 


selectin L 
(lymphocyte 
adhesion 
molecule 1) 


1q23- 
q25 


ZOO 
_S_ 

at 


M1A1QQ /FEATURE= 
/DEFINITION=HUMLAMR Human 
laminin receptor (2H5 epitope) mRNA, 
5 end 


LAMR 
1 


laminin receptor 
i (67kD, 
ribosomal 
protein SA) 


3p21.3 


077 

£.11 

_at 


i nft?4fi /FEATURE^ 
/DEFINITION=HUMMCL1 X Human 
myeloid cell differentiation protein 
(MCL1) mRNA 








307 
_at 


J03600 /FEATURE^ 
/DEFINITION=HUMLOX5 Human 
lipoxygenase mRNA, complete cds 


ALOX5 


arachidonate 5- 
lipoxygenase 


10q11. 
2 


313 
81_ 
at 


Cluster Incl. AF076483:Homo sapiens 
peptidoglycan recognition protein 
precursor (PGRP) mRNA, complete 
cds /cds=(44,634) /gb=AF076483 
/gi=3342532 /ug=Hs. 137583 /len=690 


PGLY 
RP 


peptidoglycan 

recognition 

protein 


19q13. 
2- 

q13.3 
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317 
93_ 
at 


Cluster Incl. 
AL036554:DKFZp564J2262_r1 Homo 
sapiens clmnm, a euu 
/clone=DKF2p564J2262 /clone_end=5 
/gb=AL036554 /gi=5927801 
/ug=Hs.1379/len=517 


DEFA3 


defensin, alpha 
3, neutrophil- 
sDecific 


8pter- 
p23.3 

• 


318 
59_ 
at 


Cluster Incl. J05070:Human type IV 
collagenase mRNA, complete cds 
/cds=(19,2142) /gb=J05070 

/gi=1 7 / ^U4 /Ug— rlS. 1 O 1 # OO /ien— iioo^ 


MMP9 


matrix 

metalloproteinas 
e 9 (gelatinase 
B 92kD 
gelatinase, 
92kD type IV 
collagenase) 


20q11. 
2- 

q13.1 


318 
70_ 
at 


Cluster Incl. xi4U4o.nunrian mrviNM iur 
leukocyte antigen CD37 /cds=(63,908) 
/gb=X14046 /gi=29793 /ug=Hs. 153053 
/len=1125 




CD37 antiaen 

i 


19p13- 
q13.4 


318 
98_ 
at 


Cluster incl. uooyo/.riurnan mr\iNA\ toi 
KIAA0212 gene, complete cds 
/cds=(58,2031) /gb=D86967 
/gi=1 504007 /ug=Hs.1 54332 /len=6072 


KIAA0 
212 


KIAA0212 aene 
product 


3p25.3 


319 
36_ 
s_at 


Cluster incl. ADUU/oyu.norno sapiens 
KIAA0430 mRNA, complete cds 
/cds=(0,3172) /gb=AB007890 
/gi=2887438 /ug=Hs.166163 /len=601 1 


KIAA0 
430 


KIAA0430 aene 
product 


16p13. 
12 


320 

72 

at 


Cluster Incl. U40434:Human 
mesothelin or CAK1 antigen precursor 
mRNA, complete cds /cds=(99,1985) 
/gb=U40434 /gi=1 1 45723 
/ug=Hs.1 55981 /len=2114 


MSLN 


mesothelin 


16p13. 
3 
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320 

80 

at 


Cluster Incl. L11669:Human 
tetracvcline transporter-like protein 
mRNA, complete cds /cds=(120,1487) 
/gb=L11669 /gi=307501 
/ug=Hs. 157145 /len=1758 


TETR 
AN 


tetracycline 
transporter-like . 
protein 


4p16.3 


321 
45_ 
at 


Cluster Incl X58141:Human mRNA for 
erythrocyte adducin alpha subunit 
/cds=(1 54,2367) /gb=X58141 
/gi=28381 /ug=Hs. 1 83706 /len=3905 








66_ 
at 


Cluster Incl AB028950'Homo sapiens 
mRNA for KIAA1027 protein, partial 
cds /cds=(0,5088) /gb=AB028950 
/gi=5689390 /ug=Hs. 18420 /len=5542 


TLN1 


talin 1 


9p13 
• 


321 ! 

74_ 

at 


Cluster Incl. AF015926:Homo sapiens 
ezrin-radixin-moesin binding 
phosphoprotein-50 mRNA, complete 
cds /cds=(212 1288) /qb=AF015926 
/gi=322001 8 /ug=Hs. 184276 /len=1984 


SLC9A 
3R1 


solute carrier 
family 9 
(sodium/hydrog 
en exchanger), 
isoform 3 
regulatory factor 
1 


17q25. 
2 


321 

84 

at 


Cluster Incl. X61118:Human TTG-2 
mRNA for a cvsteine rich protein with 
LIM motif /cds=UNKNOWN 
/gb=X61118 /gi=663012 
/ug=Hs.1 84585 /len=2292 


LM02 


LIM domain only 
2 (rhombotin- 
like 1) 


11p13 


322 
29_ 
at 


Cluster Incl. AF038957:Homo sapiens 
translation initiation factor 4e mRNA, 
complete cds /cds=(58,768) 
/gb=AF038957 /gi=3329383 
/ug=Hs.19122 /len=961 


EIF4E 
L3 


eukaryotic 
translation 
initiation factor 
4E-like 3 


2q37.1 
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322 
32_ 

Sit 


Cluster Incl. AF047181:Homo sapiens 
NADH-ubiquinone oxidoreductase 

ciihiintt PI-^f^HH mRNIA rnmnlptp rrte 

/cds=(6,575) /gb=AF047181 
/gi=2909853 /ug=Hs.19236 /len=1034 

i 


NDUF 
B5 


NADH 

dehydrogenase 
^ubiauinone') 1 
beta 

subcomplex, 5 
(16kD, SGDH) 


3q27.1 

• 


59_ 
at 


for KIAA0388 gene, complete cds 
/cds=(1 00,2343) /gb=AB002386 
/gi=2224716 /ug=Hs.1 94669 /len=4606 


EZH1 

1 — £— 1 1 1 


pnhancer of 

ii mi iwwi wi 

zeste 

(Drosophila) 
homolog 1 


17q21. 
1- 

q21.3 


323 
19_ 
at 


Cluster Incl. AL022310:dJ395P12.2 
(tax-transcriptionally activated 
glycoprotein i \z>**)s\j) iiycuiu, 
OX40L)) /cds=(1 37,688) 
/gb=AL022310 /gi=3646083 
/ug=Hs.181097 /len=3470 








23_ 
at 


oiusier inui. ivioooo^. nui i idi i 

preprothyrotropin-releasing hormone 
gene /cds=(8,736) /gb=M63582 
/gi=1 90297 /ug=Hs. 182231 /len=1457 








324 
34_ 
at 


Cluster Incl. D10522:Homo sapiens 
mRNA for 80K-L protein, complete cds 

/rric-rtfiQ l^fiy^ /nh=D10S22 

/CU5 — ^OD9, IOD/ ) ' y U \-> 

/gi=219893 /ug=Hs.75607 /len=2589 


MACS 


myristoylated 
alanine-rich 
nrotein kinase C 
substrate 
(MARCKS, 80K- 
L) 


6q22.2 


325 
06_ 
at 


Cluster Incl. AB029031:Homo sapiens 
mRNA for KIAA1108 protein, partial 
cds /cds=(0,2291) /gb=AB029031 
/gi=5689552 /ug=Hs.69472 Vlen=2576 


TBC1 
D1 


TBC1 (tre- 
2/USP6, BUB2, 
cdc16) domain 
family, member 
1 


4 
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• 






1 




325 
30_ 
at 


Cluster Incl. X56468:Human mRNA for 
14.3.3 protein, a protein kinase 
regulator /cds=(1 25,862) /gb=X56468 


YWHA 
Q 


tyrosine 3- 

monooxygenase 

/tryptophan 5- 

monooxygenase 

activation 

protein, theta 

polypeptide 


22q12- 
qter 


325 

A O 

at 


Cluster Incl. M84739:Human 

complete cds /cds=(1 08,1 361) 
/gb=M84739 /gi=1 79881 /ug=Hs.75525 
/len=1937 


CALR 


calreticulin 


19p13. 
3- 

p13.2 


one 

325 
97_ 
at 


^ine+or inH Y"7finfi1*H «^nipn^ d130 

L/IUSIGi lilUl. A / DUD 1 .n.oapicno 

mRNA for 130K protein /cds=(69,3488) 
/gb=X76061 /gi=416030 /ug=Hs.79362 
/len=4835 


RBL2 


retinoblastoma- 
Iike2(p130) 


16q12. 
2 


326 
16_ 
at 


Cluster Incl. M16038:Human lyn 
mRNA encoding a tyrosine kinase 

/orie-/OQ7 /nh=M16038 
/COS - i , iooo) /y u ivi i ij»jvj<-i 

/gi=1 87268 /ug=Hs.80887 /len=2298 


LYN 

i 


v-yes-1 

Yamaguchi 

sarcoma viral 

related 

oncogene 

homolog 


8q13 


49_ 
at 


Pinctor Inrl ySQ871 "Human TCF-1 
mRNA for T cell factor 1 (splice form 
C) /cds=(79,885) /gb=X59871 
/gi=36789 /ug=Hs. 169294 /len=2910 


TCF7 


transcription 
factor 7 (T-cell 
specific, HMG- 
box) 


5q31.1 


326 
96_ 
at 


Cluster Incl. X59841:Human PBX3 
mRNA /cds=UNKNOWN /gb=X59841 
/gi=35314 /ug=Hs.171680 /len=2581 


PBX3 


pre-B-cell 
leukemia 
I transcription 


9q33- 
q34 
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at 


/gi=35314 /ug=Hs.171680 /len=2581 




factor 3 




327 
at 


Cluster Inch X89887:Homo sapiens 
mRNA for WD reDeat Drotein (HIRA) 
/cds=(220,3273) /gb=X89887 
/gi=392821 8 /ug=Hs. 1 72350 /len=401 8 


HIRA 


HIR (histone cell 
cycle regulation 
defective) 
homolog A (S. 
cerevisiae) 


22q11. 
21 

• 


327 

47 

at 


Cluster Incl. X05409:Human RNA for 
mitochondrial aldehyde 

lilt LVJOI Ivl 1 \*i 1 Ivll ■ ^ v ■ ^ w 

dehydrogenase 1 ALDH 1 (EC 1.2.1.3) 
/cds=(36,1586) /gb=X05409 /gi=28605 
/ug=Hs.1 95432 /len=1 989 


ALDH2 


aldehyde 
dehydrogenase 
2 family 
(mitochondrial) 


12q24. 
2 


75_ 
r_at 


pinct<ar inH ARnnfi74fi*Homo saoiens 
hMmTRAI b mRNA, complete cds 
/cds=(256,1212) /gb=AB006746 
/gi=3510296 /ug=Hs. 198282 /len=2077 


PLSC 
R1 


phospholipid 
scramblase 1 


3q23 


328 
at 


Cluster Incl. U66306:Human retinoid X 

n=»rentnr alnha mRNA 3 UTR Dartial 

sequence /cds=UNKNOWN 
/gb=U66306 /gi=341 1 007 
/ug=Hs.20084 /len=3772 








19_ 
at 


Cluster Incl AJ223352'Homo sapiens 
mRNA for for histone H2B, clone pjG4- 
5-14 /cds=(1 6,396) /gb=AJ223352 
/gi=3255996 /ug=Hs.2041 8 /len=793 


H2BFA 


H2B histone 
family, member 
A 


6p21.3 


328 
21_ 
at 


Cluster Incl. AI762213:wi54d04.x1 
Homo sapiens cDNA, 3 end 
/clone=IMAGE-2394055 /clone_end=3 
/gb=AI762213 /gi=51 77880 
/ug=Hs.204238 /len=677 


LCN2 


lipocalin 2 

(oncogene 

24p3) 


9q34 
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42_ 
at 


for BCL7A protein /cds=(953,1648) 
/gb=X89984 /gi=929614 
/ug=Hs.21 1563 /len=4522 


BCL7A 


B-cell 

CLL/lymphoma 
7A 


12q24. 
13 


331 
at 


Cluster Incl. D67031:Homo sapiens 

complete cds /cds=(1 83,2207) 
/gb=D67031 /gi=2696053 /ug=Hs.8110 
/len=2920 


ADD3 


adducin 3 
(aamma^ 


10q24. 
2- 

q24.3 


331 
21_ 
9_a 

t 


Cluster Incl. AF045229:Homo sapiens 

■I'-iff-tr r\f f^. r\rr\tcx'm cifi rial inn *1 0 

regulator ot vj protein siy nanny iu 
mRNA, complete cds /cds=(1 32,635) 
/gb=AF045229 /gi=2906029 
/ug=Hs.82280 /len=753 


RGS1 
n 


regulator of G- 
nrotein 
signalling 10 


10q25 


331 
34_ 
at 


(cluster inci. adu i luoo.norno sapiens 
mRNA for KIAA0511 protein, partial 
cds /cds=(0,2802) /gb=AB011083 
/gi=3043545 /ug=Hs.8402 /len=3563 


ADCY 
3 


dU^/ 1 IV Id Iv 

cyclase 3 


2p24- 
p22 


332 
38_ 
at 


Cluster Incl. U23852:Human T- 
lymphocyte specific protein tyrosine 
Kinass pooicK vCR/ auDciani rnrviN/A, 
complete cds /cds=(59,1 150) 
/gb=U23852 /gi=775207 /ug=Hs.1765 
/len=2129 








332 

66 

at 


Cluster Incl. AF015254:Homo sapiens 
serine/threonine kinase (STK-1) 
mRNA, complete cds /cds=(58,1 101) 
/gb=AF015254 /gi=4090840 
/ug=Hs. 180655 /len=1234 


STK12 


serine/threonine 
kinase 12 


17p13. 
1 
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332 

/ o 

f_at 


Cluster Incl. X57809:Human 

rsarr3riQ6Q iriMiiuiiuyitjijuiiii icuiiuiici 

light chain mRNA /cds=(1 14,815) 
/gb=X57809 /gi=33714 /ug=Hs.181 125 
/len=915 


IGL 


immunoglobulin 
lambda locus 


22q11. 
1- 

q11.2 


332 
74_ 
T_at 


Cluster Incl. M18645:Human Ig 
rearranged lambda-chain mRNA VJC- 
rsgion suugruup iciriiuuci-iv nuni 
heterohybridoma H6-3C4 
/cds=(30,731) /gb=M18645 /gi=186103 
/ug=Hs.181125/len=872 


IGL 


immunoglobulin 
lambda locus 


22q11.' 
1- 

a11 2 


84_ 
at 


Uiuster inci. ivi i you/ .nuriicii i 
myeloperoxidase mRNA, complete cds 
/cds=UNKNOWN /gb=M19507 
/gi=188657 /ug=Hs.1817 /len=3215 


MPD 
Ivlr VJ 


1 1 lyenjpd UAiuao 

e 


17a23 

1 


19_ 
at 


cluster inci. /\t^uyOf ^.norno sapiens 
axin (AXIN) mRNA, partial cds 
/cds=(0,2703) /gb=AF009674 
/gi=2252819 /ug=Hs. 184434 /len=3385 


AXIN1 


ft vi n 


16d13 
3 


51_ 
at 


L/iUSter inci. nruo^DU/ .nornu odpiciio 
GC20 protein mRNA, complete cds 
/cds=(70,41 1 ) /gb=AF064607 
/gi=31 52667 /ug=Hs.21756 /len=812 




translation 

11 CM lOlClUvsl 1 

factor suil 
homolog 


3d21 3 
3 


OOO 

52_ 
at 


1 1 ir+Q r 1 nr*\ Y^7Qft^' W canionc npnpc 

oiusier inci. ao# yoo.n.oci|jit?iio yenco 
for histones H2B.1 and H2A 
/cds=(42,422) /gb=X57985 /gi=510989 
/ug=Hs.2178 /len=2223 








333 
58_ 
at 


Cluster Incl. W29087:56b8 Homo 
sapiens cDNA /gb=W29087 
/gi=1 309053 /ug=Hs.21894 /len=877 


KIAA1 
157 


KIAA1 1 57 
protein 


12q13. 
3- 

q14.1 



717 
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at 


/gi=1 309053 /ug=Hs.21894 /len=877 




i 


q14.1 


333 
71_ 
s_at 


Cluster Incl. U59877:Human low-Mr 
GTP-uinaing proiein ^rvADoi; inrMm, 
complete cds /cds=(60,644) 
/gb=U59877 /gi=1388194 
/ug=Hs.223025 /len=907 


RAB31 


RAB31, 

member RAS 
oncogene family 


18p11. 
3 


333 
86_ 
at 


Cluster Incl. Z97630:Human DNA 
sequence from clone 466N1 on 
chromosome 22q12-13 Contains 
H1F0(H1 histone family, member 0) 
gens, z-airnno-o-KeiODuiyraie "vun 
ligase( nuclear gene encoding 
mitochondrial protein), GALR3 (galanin 
receptor) gene, ESTs, GSSs an 








333 
90_ 
at 


Cluster Incl. AA203487:zx53d03.r1 
Homo sapiens cui\i/\, o cmu 
/clone=IMAGE-446213 /clone_end=5 
/gb=AA203487 /gi=1 799460 
/ug=Hs.226237 /len=863 




ESTs 




334 
12_ 
at 


Cluster incl. Aioooy^o.vicpiu.^.i-'u/ .i 
Homo sapiens cDNA, 5 end 
/clone_end=5 /gb=AI535946 
/gi=4450081 /ug=Hs.227751 /len=647 


LGALS 
1 


lectin, 

galactoside- 
binding, soluble, 
1 (galectin 1) 


22q13. 
1 


334 
14_ 
at 


Cluster Incl. X57398:Human mRNA for 
pM5 protein /cds=(0,3572) /gb=X57398 
/gi=35526 /ug=Hs.227823 /len=4086 


PM5 


pM5 protein 


16p13. 
11 


334 
25_ 
at 


Cluster Incl. X97548:H.sapiens mRNA 
for TIFIbeta zinc finger protein 
/cds=(361,2868) /gb=X97548 
/gi=1524108 /ug=Hs.228059 /len=3035 


TRIM2 
8 


tripartite motif- 
containing 28 


5 
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/gi=1524108 /ug=Hs.228059 /len=3035 








334 
at 


"Cluster Incl. L41 143:Homo sapiens 

nvnrQccorl ncpiirln TfTTA mRNA at 

t(1;3) translocation site, complete cds 
/cds=(221 ,532) /gb=L41 1 43 
/gi=736684 /ug=Hs.2320697len=21 46" 






• 


54_ 
at 


pinctor inH AF01 fiQ03*Homo saoiens 
agrin precursor mRNA, partial cds 
/cds=(0,6080) /gb=AF016903 
/gi=2988421 /ug=Hs.2341 37 /len=7032 


AGRN 


agrin 


1p36.3 
-p32 


335 
30_ 
at 


Cluster Incl. M33326:Human 

nonSpGClTIC crOSS-reciwAiiiy diiuydi 

(NCA) mRNA, complete cds 
/cds=(86, 1 1 35) /gb=M33326 
/gi=189101 /ug=Hs.41 /len=2287 


CEAC 
AM8 


carcinoembryoni 
c antigen- 
related cell 
adhesion 
molecule 8 


19q13. 
2 


336 
89_ 
s_at 


Cluster Incl. AF012434:untitled 
/cds=(38,394) /gb=AF012434 
/gi=2352914 /ug=Hs. 186570 /len=573 








337 
31_ 
at 


Cluster Incl. AJ130718:Homo sapiens 
mRNA for glycoprotein-associated 
amino acia uanofjuiiei y^uj^i i 
/cds=(293,1828) /gb=AJ130718 
/gi=3970724 /ug=Hs.1 94693 Vlen=2214 


SLC7A 
7 


solute carrier 
family 7 
(cationic amino 
acid transporter, 
y+ system), 
member 7 


14q11. 
2 


337 
52_ 
at 


Cluster Incl. AB020657:Homo sapiens 
mRNA for KIAA0850 protein, complete 
cds /cds=(630,2558) /gb=AB020657 
/gi=4240188 /ug=Hs. 197298 /len=3682 


NS1- 
BP 


NS1-binding 
protein 


1q25.1 
-q31.1 



719 



WO 03/039443 



PCT/EP02/12303 



338 
12_ 
ax 


"Cluster Incl. AL04941 5:Homo sapiens 
mRNA; cDNA DKFZp586N2119 (from 
Hnnp DKFZd586N21 1 9) 
/cds=UNKNOWN /gb=AL049415 
/gi=4500196 /ug=Hs.204290 
/len=1232" 








338 
21_ 
at 


Cluster Incl. AL034374: Human DNA 
sequence from clone 483K16 on 
chromosome 6p1 2. 1-21.1. Contains 
(parts of) two novel genes, 40S 

RiHncnmal nrnte>in S16 and 60S 

Ribosomal protein L31 pseudogenes, 
ESTs, STSs, GSSs and a putative 
CpG island /cds=(0,703) /gb=AL 








56_ 
at 


oinetor incl Y1 ^74 - Homo saoiens 
mRNA for putatively prenylated protein 
/cds=(343,972) /gb=Y13374 
/gi=2370152 /ug=Hs.239533 /len=1 186 


CXX1 


CAAX box 1 


Xq26 


OjO 

60_ 
at 


r^iiKst^r inri AB007931 Homo sapiens 
mRNA for KIAA0462 protein, partial 
cds /cds=(0,6831) /gb=AB007931 
/gi=3413885 /ug=Hs.239686 /len=7150 


KIAAO 
462 


KIAA0462 
protein 


1p36.1 
3 


ooo 
66_ 
at 


pin«stp«r inrl X05276 - Human mRNA for 
fibroblast tropomyosin TM30 (pi) 
/cds=(50,796) /gb=X05276 /gi=37201 
/ug=Hs.239804 /len=2049 


TPM4 


tropomyosin 4 


19p13. 
1 


339 
05_ 
at 


Cluster Incl. AF072242:Homo sapiens 
methyl-CpG binding protein MBD2 
(MBD2) mRNA, complete cds 
/cds=(229,1464) /gb=AF072242 


MBD2 


methyl-CpG 
binding domain 
protein 2 


18q21 
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/gi=3800792 /ug=Hs.25674 /len=1948 








339 
at 


Cluster Incl. AF051782:Homo sapiens 

complete cds /cds=(0,3746) 
/gb=AF051782 /gi=2947237 
/ug=Hs.26584 /len=5635 


DIAPH 

1 


diaphanous 
(Drosophila, 
homolog) 1 


5q31 

• 


24_ 
at 


L/IUSlci I riOI. r\DU,tJJu iH.numu oajjicu«a 

mRNA for KIAA1091 protein, partial 
cds /cds=(0,4080) /gb=AB029014 
/gi=5689518 /ug=Hs.26797 /len=4248 


KIAA1 
091 


KIAA1091 
protein 


11 


339 

A A 

44_ 

at 


Cluster Incl. S60099:APPH=amyloid 
precursor proieiri^ iiuiiiuiuy [nuiuan, 
placenta, mRNA, 3727 nt] 
/cds=(72,2363) /gb=S60099 
/gi=300168 /ug=Hs.64797 /len=3727 


APLP2 


amyloid beta 
(A4} Drecursor- 
like protein 2 


11q24 


oon 

63_ 
at 


OIUSTGr incl. ivi*7wo^w- nui i icai i 

azurocidin gene, complete cds 
/cds=(1 6,771) /gb=M96326 /gi=1 79301 
/ug=Hs.72885 /len=913 








10_ 
9_a 

t 


oine+or inH AFH1 fV Hnmo saoiens 

V^/IUolwi 1 1 IwL /aI \J 1 KJ%J 1 \J. 1 IV-/ 1 1 oapiw* 1 

p53 induced protein mRNA, partial cds 
/cds=(0,761) /gb=AF010310 
/gi=241 5296 /ug=Hs.21 1 605 /len=888 


PIG6 


proline oxidase 
homolog 




68_ 
at 


r^inctor inrl M1 1 777'Human terminal 
transferase mRNA, complete cds 
/cds=(328,1854) /gb=M11722 
/gi=339436 /ug=Hs.234772 /len=2068 


DNTT 


deoxynucleotidy 

transferase, 

terminal 


10q23- 
q24 


342 
10 


Cluster Incl. N90866:zb11b10.s1 
Homo sapiens cDNA, 3 end 


CDW5 
2 


CDW52 antigen 
(CAMPATH-1 


1p36 
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at 

♦ 


/clone=IMAGE-301723 /clo.ne_end=3 
/gb=N90866 /gi=1444193 
/ug=Hs.214742 /len=577 


2 


antigen) 




342 
at 


Cluster Incl. M92299:Human 
homeobox 2 1 Drotein (HOX2A) 
mRNA, complete cds /cds=(275,1084) 
/gb=M92299 /gi=1 84292 /ug=Hs.22554 
/len=2037 


HOXB 
5 


homeo box B5 


17q21- 
q22 


^4^ 

06_ 
at 


Cluster Incl AB007888*Homo sapiens 
KIAA0428 mRNA, complete cds 
/cds=(1 41 4,2526) /gb=AB007888 
/gi=2887430 /ug=Hs.28578 /len=5940 


MBNL 


muscleblind 
(Drosophila)-like 


3q25 


343 
62_ 
at 


Cluster Incl. M55531:Human glucose 
transport-like 5 (GLUT5) mRNA, 
rnmnlete cds ycds=(75 1 580) 
/gb=M55531 /gi=1 83297 /ug=Hs.33084 
/ien=2218 


SLC2A 
5 


solute carrier 
family 2 
(facilitated 
glucose/fructose 
transporter), 
member 5 


1p36.2 


343 
at 


Cluster Incl. AF006043:Homo sapiens 
^-nhosnhoalvcerate dehvdroaenase 
mRNA, complete cds /cds=(692,2293) 
/gb=AF006043 /gi=2674061 
/ug=Hs.3343 /len=2467 


PHGD 
H 


phosphoglycerat 
e 

dehydrogenase 


1p12 


345 

12 

at 


Cluster Incl. J03853: Human kidney 
alDha-2-adreneraic receDtor mRNA, 
complete cds /cds=(38,1423) 
/gb=J03853 /gi=178193 
/ug=Hs. 123022 /len=1491 


ADRA 
2C 


adrenergic, 

alpha-2C-, 

receptor 


4p16 


345 
46 


Cluster Incl. AI250799:qi36g07.x1 
Homo sapiens cDNA, 3 end 


DEFA4 


defensin, alpha 
4, corticostatin 


8p23 
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at 


/clone=IMAGE-1 858620 /clone_end=3 
/gb=AI250799 /gi=3847328 
/ug=Hs.2582 /len=542 




4, corticostatin 




345 
at 


Cluster Incl. U02687:Human growth 
tactor receptor tyrosine Kinase i rv- \) 
mRNA, complete cds /cds=(57,3038) 
/gb=U02687 /gi=409572 /ug=Hs.385 
/len=3475 | 


FLT3 


fms-related 
tvrn^ine kinase 

L y 1 UOII IV WW 1 Hww 

3 


13q12 

• 


346 
54_ 
at 


Uluster inci. Aj^^^y/ y.norno sapiens* 
mRNA for MTMR1 protein 
/cds=(0,1990) /gb=AJ224979 
/gi=4128155 /ug=Hs.23200 /len=2582 


MTMR 

1 


mvntuhiilarin 

1 1 iy ULuuuiai ii i 

related protein 1 


Xq28 


346 
63_ 

at 


Cluster Incl. M28696:Human low- 
affinity igCa rc receptor (peia-rc- 
gamma-RII) mRNA, complete cds 
/cds=(41 ,916) 7gb=M28696 /gi=1 84843 
/ug=Hs.233450 /len=1416 


FCGR 


Fc fragment of 

Inf^ I ox a/ affin itw 
lyvj, luvv diiiiiiLy 

lib, receptor for 
(CD32) 


1q23 


346 
70_ 
at 


Cluster inci. uouoyy.nurnan lyso&orndi 
alpha-mannosidase (manB) gene 
/cds=(309,3341) /gb=U60899 
/gi=2209014 /ug=Hs.234070 /len=3443 








346 

TO 

79_ 
at 


Cluster Incl. X02596:Human mRNA for 
Dcr (DreaKpomt ciusier region; gene in 
Philadelphia chromosome 
/cds=(488,4303) /gb=X02596 
/gi=29420 /ug=Hs.234799 /len=4739 


BCR 


breakpoint 
plu^fpr reaion 


22q11. 
23 


347 
80_ 
at 


Cluster Incl. AB002313:Human mRNA 
for KIAA0315 gene, partial cds 
/cds=(0,5526) /gb=AB002313 
/gi=2280475 /ug=Hs.3989 /len=6252 


PLXN 
B2 


plexin B2 


22q13. 
33 



723 
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t 


/gi=2280475 /ug=Hs.3989 /len=6252 








85_ 
at 


Pliieter Inrl AR0?8Q48Homo SSDiens 

mRNA for KIAA1025 protein, partial 
cds /cds=(0,3441) /gb=AB028948 
/gi=5689386 /ug=Hs.4084 /len=6131 


KIAA1 
025 


KIAA1025 
protein 


12q24. 
22 


347 
89_ 


Cluster Incl. S69272:cytoplasmic 
antiproteinase=38 kda intracellular 
corin<=> nrntt»ina^A inhibitor rhuman 

placenta, mRNA, 1465 nt] 
/cds=(188,1318) /gb=S69272 
/gi=546087 /ug=Hs.41072 /len=1465 


SERPI 
NB6 


serine (or 
cysteine) 
proteinase 
inhibitor, clade 
B (ovalbumin), 
member 6 


6p25 


30_ 
at 


Plnctor Inrl W9^Q86 - 1 7e7 Homo 
sapiens cDNA /gb=W25986 
/gi=1 306253 /ug=Hs.4750 /len=769 


DKFZ 

P564K 

0822 


hypothetical 
protein 

DKFZp564K082 
2 


6 


348 
71_ 
at 


Cluster Incl. W30677:zb75h10.r1 
Homo sapiens cDNA, 5 end 
/clone=IMAGE-309475 /clone_end=5 

/nh— \A/^nfi77 /ai=1 31 1 730 
/ug=Hs.5019/len=614 




Homo sapiens 
cDNA FLJ11714 
fis, clone 
HEMBA1 00521 
9, weakly similar 
to NUCLEAR 
PROTEIN SNF7 




92_ 
at 


Philter Incl AF016266'Homo saDiens 
TRAIL receptor 2 mRNA, complete cds 
/cds=(1 17,1439) /gb=AF016266 
/gi=2529562 /ug=Hs.51233 /len=3972 


TNFR 
SF10B 


tumor necrosis 
factor receptor 
superfamily, 
member 10b 


8p22- 
p21 


350 
16_ 
at 


Cluster Incl. M13560:Human la- 
associated invariant gamma-chain 
gene /cds=(795,1493) /gb=M 13560 
/gi=184518 /ug=Hs.84298 /len=2080 









724 
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/gi=184518 /ug=Hs.84298 /len=2080 








350 
78_ 

of 


Cluster Incl. X93093:H.sapiens LW 
gene /cds=(9,824) /gb=X93093 
ln\— •\ACkMCY7 /nn=H«? 108287 /Ien=1243 

t 


ICAM4 


intercellular 
adhesion 
molecule 4. 
Landsteiner- 
Wiener blood 
group 


19p13. 
2-cen 

• 


351 

OH 

at 


Cluster Incl. W68046:zd42a12.s1 

/clone=IMAGE-343294 /clone_end=3 
/gb=W68046 /gi=1 376935 
/ug=Hs.25817/len=575 


BTBD2 


BTB (POZ) 
domain 
containing 2 


19p13. 
3 


352 
04_ 
at 


Cluster Incl. U52840:Homo sapiens 
semaphorin F homolog mRNA, 
complete cds /cds=(637,3861) 
/gb=U52840 /gi=2772583 
/ug=Hs.27621 /len=8056 


SEMA 
5A 


sema domain, 
seven 

thrombospondin 
repeats (type 1 
and type 1 -like), 
transmembrane 
domain (TM) 
and short 
cvtoolasmic 

wy iw iuwi » 1 1 w 

domain, 

(semaphorin) 

5A 


5p15.2 


30_ 
at 


Pliio+or InH AFnvn'^fVHnmn ^anipns 

clone 24751 unknown mRNA 
/cds=(0,1252) /gb=AF070530 
/gi=3387885 /ug=Hs.29344 /len=1560 


CL247 
51 


hvoothetical 
protein, clone 
24751 


19p13. 
3 


352 
60_ 
at 


Cluster Incl. AB020674:Homo sapiens 
mRNA for KIAA0867 protein, complete 
cds /cds=(1 52,1 732) /gb=AB020674 


MOND 
OA 


KIAA0867 
protein 


12q21. 
31 
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at 


/gi=4240222 /ug=Hs.52081 /len=4339 j 




i 




i 

352 
64_ 
at 


Cluster Incl. AF067139:Homo sapiens 
NADH-ubiquinone oxidoreductase 
NDUFS3 subunit mRNA, nuclear gene 
pncodina mitochondrial protein, 
complete cds /cds=(1 2,806) 
/gb=AF067139 /gi=3337440 
/ug=Hs.5273 /len=887 


NDUF 
S3 


NADH 

dehydrogenase 
(ubiquinone) Fe- 
S protein 3 
(30kD) (NADH- 
coenzyme Q 
reductase) 


11p11. 
11 


352 
r_at 


Cluster Incl. M33680:Human 26-kDa 
r*»ll surface Drotein TAPA-1 mRNA, 
complete cds /cds=(238,948) 
/gb=M33680 /gi=338677 /ug=Hs.54457 
/len=1480 


CD81 


CD81 antigen 
(target of 
antiproliferative 
antibody 1) 


11p15 


353 
at 


Cluster Incl. AF091084:Homo sapiens 
rinnp fi^8 unknown mRNA. comDlete 
sequence /cds=(30,413) 
/gb=AF091084 /gi=3860005 
/ug=Hs.5825 /len=1065 


LOC51 
706 


cytochrome b5 
reductase 1 
(B5R.1) 


1p36.1 
3-q41 


353 
40_ 

at 


Cluster Incl. AI819948:wj88e11.x1 
Homo sapiens cDNA, 3 end 
/clone=IMAGE-2409932 /clone end=3 
/gb=AI819948 /gi=5439027 
/ug=Hs.5947 /len=569 


MEL 


mel 

transforming 
oncogene 
(derived from 
cell line NK14)- 
RAB8 homolog 


19p13. 
1 


50_ 
at 


Cluster Incl. AB01 1 170:Homo sapiens 
mRNA for KIAA0598 protein, complete 
cds /cds=(581 ,2266) /gb=AB011170 
/gi=3043719 /ug=Hs.6079 /len=4712 


GALN 

AC4S- 

6ST 


B cell RAG 

associated 

protein 


10q26 


353 
55 


Cluster Incl. AB020697:Homo sapiens 
mRNA for KIAA0890 protein, complete 


DDX30 


DEAD/H (Asp- 
Glu-Ala- 


3p21.3 
1 



726 
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at 


cds /cds=(1 43,3727) /gb=AB020697 
/gi=4240268/ug=Hs.6141 /len=3800 




Asp/His) box 
polypeptide 30 


1 


353 

72 

r_at 


Cluster Incl. M17017:Human beta- 
thromboalobulin-like Drotein mRNA, 
complete cds ' /cds=(90,389) 
/gb=M17017 /gi=1 79579 /ug=Hs.624 
/len=1639 


IL8 


interleukin 8 


4q13- 
q21 

• 


356 
38_ 
at 


Cluster Incl. D43638:Human mRNA for 
MTG8a protein, complete cds 
/cds=(41 1 ,21 44) /gb=D43638 
/ai=940399 /ua=Hs 31 551 /len=3460 


CBFA2 
T1 


"core-binding 
factor, runt 
domain, alpha 
subunit 2; 
translocated to, 
1; cyclin D- 
related" 


8q22 


43_ 
at 


Cluster Incl X76732 H saoiens mRNA 
for NEFA protein /cds=(219,1481) 
/gb=X76732 /gi=2706486 /ug=Hs.3164 
/len=1586 


NUCB 

2 


nucleobindin 2 


11p15. 
1-p14 


357 
66_ 
at 


Cluster Incl M26326Human keratin 18 
mRNA, complete cds /cds=(51,1343) 
/gb=M26326 /gi= 186690 /ug=Hs.651 14 
/len=1412 


KRT18 


keratin 18 


12q13 


358 

13 

at 


Cluster Incl. AA192359:zp91c12.s1 
Homo saDiens cDNA, 3 end 
/clone=IMAGE-627574 /clone_end=3 
/gb=AA1 92359 /gi=1781699 
/ug=Hs.69235 /len=715 


TRN- 
SR 


transportin-SR 


7q32.2 
-q32.3 


358 
23_ 
at 


Cluster Incl. M63573: Human secreted 
cyclophilin-like protein (SCYLP) 
mRNA, complete cds /cds=(21,671) 


PPIB 


peptidylprolyl 
isomerase B 
(cyclophilin B) 


15q21- 
q22 
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at 

1 


/gb=M63573 /gi=337998 /ug=Hs.699 
/len=893 




(cyclophilin B) 

i 




359 
19_ 
at 


UlUStBT lilCI. juJUUO.MUinaii 

transcobalamin I mRNA, complete cds 
/cds=(75, 1 376) /gb=J05068 
/gi=307478 /ug=Hs.2012 /len=1537 


TCN1 


transcobalamin 1 
(vitamin B12 
binding protein, 
R binder family) 


11q11- 
q12 


359 
39_ 
s_at 


Cluster Incl. L20433:Human octamer 
binding transcription iaciur i iwinj 
mRNA, complete cds /cds=(234,1496) 
/gb=L20433 /gi=418015 
/ug=Hs.21 1588 /len=3824 


POU4 
F1 


POU domain, 
class 4, 
transcription 
factor 1 


13q21. 
1-q22 


359 
40_ 
at 


Cluster inCI. AO*H3Z*».rl. sapiens inrM^n 

for RDC-1 POU domain containing 
protein /cds=(277,1272) /gb=X64624 
/gi=35914 /ug=Hs.21 1588 /len=3492 


POU4 
F1 


POU domain, 
class 4, 
transcription 
factor 1 


13q21. 
1-q22 


359 
41_ 
f_at 


Cluster Incl. U91329:Human kinesin- 
like motor protein in.it mrsjNi-v, 
complete cds /cds=(1 1 3,3424) 
/gb=U91329 /gi=2738148 
/ug=Hs.211611 /len=4058 


KIF1C 


kinesin family 
member 1 C 


17p13 


359 
95_ 
at 


Cluster Incl. AF067656:Homo sapiens 
Z.W10 interactor t-wuii mrvivii 
complete cds /cds=(24,857) 
/gb=AF067656 /gi=3901271 
/ug=Hs.42650 /len=1639 


ZWINT 


ZW1 0 interactor 


10q21- 
q22 


360 
21_ 
at 


"Cluster Incl. AL049409:Homo sapiens 
mRNA; cDNA DKFZp586H0919 (from 
clone DKFZp586H0919) 
/cds=UNKNOWN /gb=AL049409 
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1 


/gi=4500194 /ug=Hs.44865 /len=1419" 








obU 
95_ 
at 


Homo sapiens cDNA /clone=IMAGE- 
20074 /gb=N99340 /gi=1 270755 
/ug=Hs.7357 /len=1110 


DKFZ 

P586N 

1922 


DKFZP586N192 
2 protein ; 


19q13. 
1 


361 
at 


Cluster Incl. D90144:Homo sapiens 

□6116 TOl LU/O oipiid [Jicouioui , 

complete cds /cds=(86,364) 
/gb=D90144 /gi=2 19905 /ug=Hs.73817 
/len=781 1 


SCYA 
3 


small inducible 
cvtokine A3 
(homologous to 
mouse Mip-1a) 


17q11- 
q21 


361 
23_ 
at 


Cluster inci. uo/^y^.norno sapiens 
mRNA for rhodanese, complete cds 
/cds=(48,941) /gb=D87292 
/gi=1 877030 /ug=Hs.74097 /len=1 137 

i 


I O 1 


thin sulfate 

sulfurtransferas 
e (rhodanese) 


22q13. 
1 


361 
at 


"Cluster Incl. AL050289:Homo sapiens 

-v-dma- /sHMA r\l^ , P"7n^ftfif^n f >99 /from 

clone DKFZp586G0522) 
/cds=(179,1876) /gb=AL050289 
/gi=4886510 /ug=Hs.7446 /len=2364" 


C6orf5 


chromosome 6 
nnpn readino 
frame 5 


6q21 


55_ 
at 


C*\\ icfor \r\r*\ nAVilfi'v Hi imflll mRMA fnr 
L/IUSl©r inci. UO/H'Ou.nuil Idll nirM>in iui 

KIAA0275 gene, complete cds 
/cds=(316,1590) /gb=D87465 
/gi=1665814 /ug=Hs.74583 /len=5316 


KIAA0 

275 


KIAA0275 qene 
product 


1 0pter- 
q25.3 


361 
65_ 

at 


Cluster Incl. W51774:zc48b04.r1 
Homo sapiens cDNA, 5 end 
/clone=IMAGE-325519 /clone_end=5 
/gb=W51774 /gi=1 349666 
/ug=Hs.74649 /len=678 


COX6 
C 


cytochrome c 
oxidase subunit 
Vic 


8q22- 
q23 
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361 
_at 


Y13620 /FEATURE= 
/DEFINITION=HSRNABCL9 Homo 
sapiens mRNA for BCL9 gene 


BCL9 


B-cell 

CLL/lymphoma 
9 


1q21 


362 
39_ 
at 


Cluster Incl. Z49194:H.sapiens mRNA 
for oct-binding factor /cds=(523,1293) 
/gb=Z49194 /gi=974830 /ug=Hs.2407 
/len=3301 


POU2 
AF1 


POU domain, 
class 2, 
associating 
factor 1 


11q23. 
1 

• 


364 
64_ 
at 


Cluster Incl X94323:H. sapiens mRNA 
for SGP28 protein /cds=(40,777) 
/gb=X94323 /gi=1213612 
/ug=Hs.54431 /len=2124 


SGP28 


specific granule 
protein (28 kDa) 


6p12.3 


365 
36_ 
at 


Cluster Incl AF070614'Homo sapiens 
clone 24732 unknown mRNA, partial 
cds /cds=(0,1147) /gb=AF070614 
/gi=3283878 /ug=Hs.61490 /len=1734 


SCHIP 
1 


schwannomin- 
interacting 
protein 1 


3q26.1 


365 

53 

at 


Cluster Incl. AA669799:ag36c04.s1 
Homo saDiens cDNA 3 end 
/clone=IMAGE-1 1 1 8886 /clone_end=3 
/gb=AA669799 /gi=2631298 
/ug=Hs.6315/len=679 


ASMT 
L 


acetylserotonin 
O- 

methyltransfera 
se-like 


Xp22.3 


365 
71_ 
at 


Cluster Incl X68060:H. sapiens topi lb 
mRNA for topoisomerase lib 
/cds=(0,4865) /gb=X68060 /gi=37230 
/ug=Hs.75248 /len=4866 


TOP2 
B 


topoisomerase 
(DNA) II beta 
(180kD) 


3p24 


365 
88_ 
at 


Cluster Incl. AB018353:Homo sapiens 
mRNA for KIAA0810 protein, partial 
cds /cds=(0,2475) /gb=AB018353 
/gi=3882340 /ug=Hs.7531 /len=4047 


KIAAO 
810 


KIAA0810 
protein 
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366 

Oil. 
at 


Cluster inci. ivioojuo.nurndii vincuun 
mRNA, complete cds /cds=(50,3250) 
/gb=M33308 /gi=340236 /ug=Hs.75350 
/len=5102 


\/c\ 

VOL- 


vinculin 

VII 1 1 1 1 1 

1 


1 0q22. 
1-q23 


366 
07_ 
at 


Cluster inci. ^.yy/ lo.urv/iiouLj iu.a 
(alpha-N-acetylgalactosaminidase) 
/cds=(472,1707) /gb=Z99716 
/ g i=4456457 /ug=Hs.75372 /len=3606 








366 
08_ 
at 


Cluster Inci. D55654:Human mRNA for 
cytosolic maiate aenyarogenase, 
complete cds /cds=(55,1059) 
/gb=D55654 /gi=1 255603 
/ug=Hs.75375 /len=1 267 


MDH1 


maiate 

Hphx/H ronenase 
1 , NAD (soluble) 


2p16 


366 
29_ 
at 


Cluster Inci. AI635895:tz82a07.x1 

i 

Homo sapiens cDNA, o ena 
/clone=IMAGE-2295060 /clone_end=3 
/gb=AI635895 /gi=4687225 
/ug=Hs.75450 /len=1082 


DSIPl 


delta sleep 

II iuuwii iy 

peptide, 
immunoreactor 


Xp21.1 
-q25 


366 
38_ 
at 


Cluster Inci. A/oy4/.rt. sapiens mKiNM 
for connective tissue growth factor 
/cds=(1 45, 1 1 94) /gb=X78947 
/gi=474933 /ug=Hs.75511 /len=2312 


o i on 


tissue growth 
factor 


6q23.1 


366 
44_ 
at 


Cluster inci. u^yyoo.norno sapiens 
mRNA for CD151, complete cds 
/Cds=(84,845) /gb=D29963 
/gi=2073384 /ug=Hs.75564 /len=1486 




CD151 antioen 


11p15. 
5 


366 
61_ 
s_at 


Cluster Inci. X06882:Human gene for 
CD14 differentiation antigen 
/cds=(105,1232) /gb=X06882 
/gi=29736 /ug=Hs.75627 /len=1356 


CD14 


CD14 antigen 


5q31.1 
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/gi=29736 /ug=Hs.75627 /len=1356 








366 

Of 

at 


Cluster Incl. U47025:Human fetal brain 
nk/rr»np>n nhosnhorvlase B mRNA. 

UlywvMvl 1 uiiuvui ivl vmwv *—* ■■■■ » ■ . ' * j 

complete cds /cds=(34,2565) 
/gb=U47025 * /gi=1 172225 
/ug=Hs.75658 /len=4055 


PYGB 


"phosphorylase, 
glycogen; brain" 


20p11. 
2- 

p11.1 

• 


366 
an 

at 


Cluster Incl. M1 0901 .Human 

oh \fr\mrt\m\c\ r^r^pntox alDha mRNA 

complete cds /cds=(1 32,2465) 
/gb=M10901 /gi=1 83032 /ug=Hs.75772 
/len=4788 


NR3C 
1 


nuclear receptor 
subfamily 3, 
group C, 
member 1 


5q31 


09_ 
at 


r^inctor inH Y0nflQ3H saDiens mRNA 
for leukocyte adhesion glycoprotein 
p1 50,95 /cds=(64,3555) /gb=Y00093 
/gi=35175 /ug=Hs.51077 /len=4654 


ITGAX 


integrin, alpha X 
(antigen CD11C 
(pi 50), alpha 
polypeptide) 


16p11. 
2 


10_ 
at 


r^inctor inri 7^809fi*H ^aniens mRNA 
for FALL-39 peptide antibiotic 
/cds=(1 1 ,523) /gb=Z38026 /gi=558378 
/ug=Hs.51120 /len=615 

i 


CAMP 


cathelicidin 

antimicrobial 

peptide 


3p21.3 


OD / 

49_ 
at 


Pinct^r inri M7T720"Human mast cell 
carboxypeptidase A (MC-CPA) gene 
/cds=(1 1 , 1 264) /gb=M73720 
/gi=1 87441 /ug=Hs.646 /len=1633 








367 
66_ 
at 


Cluster Incl. X55988:Human EDN 
mRNA for eosinophil derived 
neurotoxin /cds=(71 ,556) /gb=X55988 
/gi=31088 /ug=Hs.728 /len=735 


RNAS 
E2 


ribonuclease, 
RNase A family, 
2 (liver, 
eosinophil- 
derived 
neurotoxin) 


14q24- 
q31 
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367 
73_ 
f_at 


Cluster Incl. M81 141 -.Human MHC 
class II HLA-DQ-Deta mKNA ^ljk/ 
DQw2), complete cds /cds=(35,820) 
/gb=M81141 /gi=1 88202 /ug=Hs.73933 
/len=1171 


HLA- 


major 

hi^tnpomnatihilit 

1 IIOlvvVI 1 1 k-* Q LI VJ MIL 

y complex, class 
II, DQ beta 1 


6p21.3 


367 
90_ 
at 


Cluster Incl. Miyzo/.numan 
tropomyosin mRNA, complete cds 
/cds=(286,1140) /gb=M19267 
/gi=339943 /ug=Hs.77899 /len=1633 




li v|JUi i lyuou i i 

(alpha) 


15a22 

1 


367 
98_ 

g_a 

t 


Cluster Incl. J041 68. Human leuKosiann 
mRNA, complete cds /cds=(95,1297) 
/gb=J041 68 /gi=1871 1 8 /ug=Hs.80738 
/len=2288 


cdm 
orIN 


oidiufji iui ii i 

(9PL115, 

leukosialin, 

CD43) 


16d1 1 
2 


368 
02_ 
at 


Cluster Incl. M23197:Human 

i 

differentiation antigen (CD33) mKNA, 
complete cds /cds=(12,1 106) 
/gb=M23197 /gi=1 80097 /ug=Hs.83731 
/len=1437 


CD33 


CD33 antigen 
19 p°' ) 


19q13. 


368 
09_ 
■ at 


Cluster Incl. L01664:Human eosinophil 
Charcot-Leyden crystal (OLU) protein 
(lysophospholipase) mRNA, complete 
cds /cds=(33,461) /gb=L01664 
/gi=1 87273 /ug=Hs.889 /len=586 


CLC 


Charot-Leyden 


19q13. 


368 
43_ 
at 


Cluster Incl. AB005666:Homo sapiens 
mRNA for GTPase-activating protein, 
complete cds /cds=(296,3424) 
/ab=AB005666 /gi=2389008 
/ug=Hs.7019/len=3885 


SIPA1 


signal-induced 

nrolif^ratinn- 

yj \ \j\ 1 1 ci auui i 

associated gene 
1 


11q13. 

3 


368 
73 


Cluster Incl. D16532:Human gene for 
very low density lipoprotein receptor, 
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at 


5flanking and /cds=(61 5,3236) 
/gb=D16532 /gi=407220 /ug=Hs.73729 
/len=3853 








368 

78 

f_at 


Cluster Incl. M60028: Human MHC 
class II HLA-DQ-beta (DQB1 ,DQw9), 
complete cds /cds=(57,842) 
/gb=M60028 /gi=188114 /ug=Hs. 73931 
/len=1192 


HLA- 
DQB1 


major 

histocompatibilit 
y complex, class 
II, DQ beta 1 


6p21.3 

• 


368 
81_ 
at 


Cluster Incl. X71 129:H.sapiens mRNA 
for electron transfer flavoprotein beta 
subunit /cds=(27,794) /gb=X71129 
/gi=297901 /ug=Hs.74047 /len=835 


ETFB 


electron- 
transfer- 
flavoprotein, 
beta polypeptide 


19q13. 
3 


368 
94_ 
at 


Cluster Incl. AL031846:dJ742C19.5 
(novel Chromobox protein) 
/cds=(89,844) /gb=AL031846 
/gi=41 64368 /ug=Hs.7442 /len=3964 








369 

Www 

00_ 
at 


Cluster Incl. U52426:Homo sapiens 
GOK (STIM1) mRNA, complete cds 
/cds=(565,2622) /gb=U52426 
/gi=2264345 /ug=Hs.74597 /len=4040 . 


STIM1 


stromal 
interaction 
molecule 1 


11p15. 
5 


369 

W w v/ 

36_ 
at 


Cluster Incl. U58766:Human FX 
protein mRNA, complete cds 
/cds=(74,1039) /gb=U58766 
/gi=1381178 /ug=Hs.75801 /len=1330 


TSTA3 


tissue specific 
transplantation 
antigen P35B 


8q24.3 


369 
45_ 
at 


Cluster Incl. X94910:Homo sapiens 
mRNA for ERp28 protein 
/cds=(1 1 ,796) /gb=X9491 0 
/gi=3413292 /ug=Hs.75841 /len=892 


C12orf 
8 


chromosome 12 
open reading 
frame 8 


12q24. 
13 



734 

BNSDOCID: <W O Q3039443A2 I > 



WO 03/039443 



^PCT/EP02/12303 



369 
52; 
at 


Cluster Incl. D16480:Homo sapiens 
mRNA for mitochondrial enoyl-CoA 
hydratase/3-hydroxyacyl-CoA 
dehydrogenese alpha-subunit of 
trifunctional protein, complete cds 
/cds=(27,2318) /gb=D16480 
/gi=493657 /ug=Hs.75860 /len=2690 


HADH 
A 


hydroxyacyl- 

Coenzyme A 

dehydrogenase/ 

3-ketoacyl- 

Coenzyme A 

thiolase/enoyl- 

Coenzyme A 

hydratase 

(trifunctional 

protein), alpha 

subunit 


2p23 


369 
63_ 
at 


Cluster Incl. U30255:Human 
phosphogluconate dehydrogenase 
(hPGDH) gene, complete cds 
/cds=(6,1457) /gb=U30255 /gi=984324 
/ug=Hs.75888/len=1536 , 


PGD 


phosphoglucona 
te 

dehydrogenase 


1p36.3 
P36.13 


369 
73_ 
at 


Cluster Incl. U41371:Human 
spliceosome associated protein (SAP 
145) mRNA, complete cds 
/cds=(48,2666) /gb=U41371 
/gi=1 173904 /ug=Hs.75916 /len=2820 


SF3B2 


splicing factor 
3b, subunit 2, 
145kD 


11q13. 
1 


369 
80_ 
at 


Cluster Incl. U03105:Human B4-2 
protein mRNA, complete cds 
/cds=(113,1096) /gb=U03105 
/gi=476094 /ug=Hs.75969 /len=2061 


B4-2 


proline-rich 
protein with 
nuclear 

targeting signal 


6q16.1 


369 
96_ 
at 


Cluster Incl. U41635:Human OS-9 
precurosor mRNA, complete cds 
/cds=(85,2088) /gb=U41635 
/gi=1 322233 /ug=Hs.76228 /len=2736 


OS-9 


amplified in 
osteosarcoma 


12q13 


370 
01_ 


Homo sapiens /REF=M23254 
/DEF=Cluster Incl. :Human Ca2- 


CAPN 
2 


calpain 2, (m/ll) 
large subunit 


1q41- 
q42 
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at 


activated neutrai protease icnye 
subunit (CANP) mRNA, complete cds 
/cds=(1 30,2232) /gb= /gi=511636 
/ug=Hs.76288 /len=3213 /LEN=3435 


2 


large subunit 


q42 


1 370 
15_ 
at 


Cluster Incl. KUoUUU.numan aiaenyue 
dehydrogenase 1 mRNA /cds=(0, 1022) 
/gb=K03000 /gi= 178399 /ug=Hs.76392 
/len=1560 


ALDH1 
A1 


aldehvde 
dehydrogenase 
1 family, 
member A1 


9q21 

• 


370 
18_ 
at 


Cluster Incl. AI189287:qd05c04.x1 
Homo sapiens cUNA, o ena 
/clone=IMAGE-1 722822 /clone_end=3 
/gb=A1 1 89287 /gi=3740496 
/ug=Hs.7644 /len=738 


H1F2 


H1 histone 
familv member 
2 


6p21 .3 


I 370 
21_ 
at 


Cluster Incl. Xloooz. Human mruNM ror 
cathepsin H (EC 3.4.22.16) 
/cds=(34,1041) /gb=X16832 /gi=29709 
/ug=Hs.76476 /len=1399 


CTSH 


catheosin H 

i 


15q24- 
q25 


370 
23_ 
at 


Cluster Incl. J02923: Human 65- 
kilodalton pnospnoprotein ipoa; 
mRNA, complete cds /cds=(74,1957) 
/gb=J02923 /gi=1 89501 /ug=Hs.76506 
/len=3175 


LCP1 


lymphocyte 
cvtosolic orotein 
1 (L-plastin) 


13q14. 
3 


370 
26_ 
at 


Cluster Incl. AF001461 :Homo sapiens 
Kruppel-liKe zinc Tinger proiem ^.ij? 
mRNA, complete cds /cds=(30,881) 
/gb=AF001461 /gi=3378030 
/ug=Hs.76526 /len=1354 


COPE 

R 


core promoter 
element bindinq 
protein 


10p15 


370 
27_ 


Cluster Incl. M80899:Human novel 
protein AHNAK mRNA, partial 
sequence /cds=(0,3835) /gb=M80899 


AHNA 
K 


AHNAK 

nucleoprotein 

(desmoyokin) 


11q12- 
q13 
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at 


/gi=1 78282 /ug=Hs.76549 /!en=4051 




(desmoyokin) 




370 
29_ 
at 


Homo sapiens /REF=X83218 
/DEF=Cluster Incl. :H.sapiens mRNA 
for ATP synthase /cds=(36,677) /gb= 
/gi=1 008079 /ug=Hs.76572 /len=750 
/LEN=826 


ATP5 
O 


ATP synthase, 
H+ transporting, 
mitochondrial 
F1 complex, O 
subunit 

fnlinomvcin 

sensitivity 
conferring 
protein) 


21q22. 
11 


370 
39_ 
at 


Cluster Incl. J00194:human nia-ar 
antigen alpha-chain mrna & ivs 
fragments /cds=(26,790) /gb=J00194 
/gi=188231 /ug=Hs.76807 /len=1 199 


nLA- 
DRA 


histocompatibilit 
y complex, class 
II, DR alpha 


6p21.3 


370 
54_ 
at 


Cluster Incl. J04739:Human 
bactericidal permeability increasing 
protein (BPI) mRNA, complete cds 
/cds=(30,1493) /gb=J04739 
/gi=1 79528 /ug=Hs.89535 /len=1813 


BPI 


bactericidal/per 

m oohi i it\/- 
1 1 ICaUIMLy 

increasing 
protein 


20q11. 
23-q12 


370 
99_ 
at 


Cluster Incl. AI806222:wf26e10.x1 
Homo sapiens cDNA, 3 ena 
/clone=IMAGE-2356746 /clone_end=3 
/gb=AI806222 /gi=5392788 
/ug=Hs. 1 001 94 /len=563 


ALOX5 

AP 

■ 


arachidonate 5- 
linnYvnpnase- 

activating 
protein 


13q12 


371 

*r 1 

at 


Cluster Incl. AF020044:Homo sapiens 
Ivmnhocvte secreted C-type lectin 
precursor, mRNA, complete cds 
/cds=(179,1150) /gb=AF020044 
/gi=2828595 /ug=Hs. 105927 /len=1391 


SCGF 


"stem eel 
growth factor; 
lymphocyte 
secreted C-type 
lectin" 


19q13. 
3 
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371 
49_ 
s_at 


Cluster Incl. U95626:Homo sapiens 
ccr2b (ccr2), ccr2a (ccr2), ccr5 (ccr5) 

orir4 r*r*rfc {nf*r&\ npnpQ COfTIDlst© COS 
3nQ CClD ^oUluy yeiico, V/UNipiwi^ wuo, 

and lactoferrin (lactoferrin) gene, 
partial cds /cds=(2,1429) /gb=U95626 
/gi=21 04517 /ug=Hs. 105938 /len=1607 






• 


371 
59_ 
at 


r^ino+cir in^i I i7Q9^Q , Human clone 
23945 mRNA, complete cds 
/cds=(636,1403) /gb=U79259 
/gi=1710213 /ug=Hs.10700 /len=1683 


DJ159 
A19.3 


hypothetical 
protein 


1p36.1 
3 


372 
63_ 
at 


Cluster Incl. U55206:Homo sapiens 
human gamma-glutamyl hydrolase 
(hGH) mRNA, complete cds 
/cds— (c>y, 1 un o) /yu ujjiuu 
/gi=2957143 /ug=Hs.78619 /len=1265 


GGH 


gamma- 
glutamyl 
hydrolase 
(coniuqase, 
folylpolygamma 
glutamyl 
hydrolase) 


8q12.1 


373 

11_ 
at 


Cluster Incl. AF010400:untitled 
/cds=(50,1063) /gb=AF010400 
/gi=2612878 /ug=Hs.77290 /len=1242 








373 
26_ 
at 


"Cluster Incl. U93305:Homo sapiens 
A4 differentiation-dependent protein 
(A4), triple LIM domain protein (LM06), 
and synaptophysin (SYP) genes, 

oAinnlatn rHc onH f*p|piljm channel 

alpha-1 subunit (CACNA1F) gene, 
partial cds /cds=(75,533) /gb=U93305 
/gi=270759" 








373 
33_ 
at 


Cluster Incl. X63692:H.sapiens mRNA 
for DNA (cytosin-5)-methyltransferase 
/cds=(237,5087) /gb=X63692 


DNMT 
1 


DNA (cytosine- 
5-)- 

methyltransfera 


19p13. 
2 
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at 

* 


/gi=1632818 /ug=Hs.77462 /len=5408 




se 1 




373 
44_ 
at 


Cluster Incl. X62744:Human RING6 
mRNA for HLA class II alpha chain-like 
product /cds=(45,830) /gb=X62744 
/gi=36062 /ug=Hs.77522 /len=1079 


HLA- 
DMA 


maior 

III CJJ \J 1 

histocompatibilit 
y complex, class 
II, DM alpha 


D(J^ 1 . 0 


373 
84_ 
at 


Cluster Incl. D13640:Human mRNA for 
KIAA0015 gene, complete cds 
/cds=(1 06,1470) /gb=D13640 
/gi=286006 /ug=Hs.77961 /len=5134 


KIAAO 
015 


KIAA001 *5 npnp 

product 


22 


373 
99_ 
at 


Cluster Incl. D 17793: Human mRNA for 
KIAA0119 gene, complete cds 
/cds=(51,1022) /gb=D17793 
/ai=457407 /ua=Hs 78183 /len=1204 


AKR1 
C3 


aldo-keto 
reductase family 
1 , member C3 
^o-ci»pna 
hydroxysteroid 
dehydrogenase, 
type II) 


10p15- 
p14 


374 
03_ 
at 


Cluster Incl. X05908:Human mRNA for 
lipocortin /cds=(74,1114) /gb=X05908 
/gi=34387 /ug=Hs.78225 /len=1399 


ANXA 
1 


annexin A1 


9q12- 
q21.2 


374 
07_ 
s_at 


Cluster Incl. AF01 3570: Homo sapiens 
smooth muscle myosin heavy chain 
SM2 mRNA, alternatively soliced 
partial cds /cds=(0,1767) 
/gb=AF013570 /gi=2352944 
/ug=Hs.78344 /len=2580 


MYH1 
1 


myosin, heavy 
polypeptide 1 1 , 


16p13. 
13- 

|J IO. I £- 


374 
08_ 
at 


Cluster Incl. AB014609:Homo sapiens 
mRNA for KIAA0709 protein, complete 
cds /cds=(1 16,4555) /gb=AB014609 
/gi=3327231 /ug=Hs.7835 /len=5641 


KIAAO 
709 


endocytic 
receptor 
(macrophage 
mannose 
receptor family) 


17q24. 
1 
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receptor family) 




374 
17_ 
at 


Cluster Incl. M36542: Human lymphoid- 
specific transcription idvAui mrvi^r^, 
complete cds /cds=(54,1445) 
/gb=M36542 /gi=339495 /ug=Hs.1101 
/len=2048 


POU2 
F2 


POU domain, 
class 2, 
transcription 
factor 2 


19q13. 
31 


375 
79_ 
at 


Cluster inci. u^f/ / oo.nuiuu ° a H io,w 
inducible protein mRNA, complete cds 
/cds=(1004,1714) /gb=L47738 
/gi=1 009098 /ug=Hs.80313 /len=2881 


PIR12 
1 


cytoplasmic 
FMRP 
interacting 
protein 2 


5q34 


376 
00_ 
at 


Cluster Incl. U68186:Human 
extracellular mainx proieui i mrvi™-*, 
complete cds /cds=(1 03,1 725) 
/gb=U68186 /gi=2660683 
/ug=Hs.81071 /len=1819 


ECM1 


extracellular 
matrix protein 1 

* 


1q21 


I 376 
15_ 
at 


Cluster Incl. uooyo<i.nuiTicin mruN/i iui 
KIAA0207 gene, complete cds 
/cds=(781 ,2547) /gb=D86962 
/gi=1 503997 /ug=Hs.81875 /len=5431 


GRB1 
0 


growth factor 
receptor-bound 
protein 10 


7p12- 
p11.2 


376 
25_ 
at 


Cluster Incl. U52682:Human 
lymphocyte specific interferon 
regulatory Tscior/inreiiciun icyuiawij 
factor 4 (LSIRF/IRF4) mRNA, complete 
cds /cds=(125,1477) /gb=U52682 
/gi=1378108 /ug=Hs.82132 Vlen=5320 


IRF4 


interferon 
regulatory factor 
4 


6p25- 
p23 


376 
57_ 
at 


Cluster Incl. Y16270:Homo sapiens 
PALM gene, exon 1 and joined CDS 
/cds=(145,1308) /gb=Y1627C 
/gi=3219601 /ug=Hs.78482 /len=2823 


I 
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376 
69_ 
s_at 


Cluster Incl. U16799:Human Na,K- 

atd^cq ci iKi i nit rnRNA 

complete cds /cds=(45,950) 
/gb=U 16799 /gi=806753 /ug=Hs.78629 
/len=1476 


ATP1B 
1 


ATPase, 
Na+/K+ 
transporting, 
beta 1 
polypeptide 


1q22- 
q25 


377 
01_ 
at 


Cluster Incl. L13463:Human helix-loop- 
neiix Dasic pnospnopi oicii i 
mRNA, complete cds /cds=(32,667) 
/gb=L13463 /gi=292054 /ug=Hs.78944 
/len=1345 


RGS2 


regulator of G- 
protein 

signalling 2, 
24kD 


1q31 


377 
39_ 
at 


Cluster Incl. M86737:Human high 
mobility group dox ^oor^r'i; mruw, 
complete cds /cds=(274,2403) 
/gb=M86737 /gi=1 84241 /ug=Hs.79162 
/len=2825 


SSRP 
i 


structure 
soecific 
recognition 
protein 1 


11q12 


377 
47_ 
at 


Cluster incl. uuo/ /U.nurnan anneAiii v 
(ANX5) gene /cds=(164,1126) 
/gb=U05770 /gi=2182176 
/ug=Hs.79274 /len=1597 








377 
55_ 
at 


OlUSter inCL AdUZj I oy.nui 1 lu ^dj-Mdio 

mRNA for KIAA0952 protein, complete 
cds /cds=(359,1927) /gb=AB023169 
/gi=4589547 /ug=Hs.7935 /len=4856 


KIAAO 
952 


KIAA0952 
protein 


20p12. 
1 


377 
62_ 
at 


UlUSter inCI. TU/ »U».n.sajjidi» hiring 

for Progression Associated Protein 
/cds=(21 8,691 ) /gb=Y07909 
/gi=1 542882 /ug=Hs.79368 /len=2774 


EMP1 


epithelial 
membrane 
protein 1 


12p12. 
3 


377 
I 75 


Cluster Incl. W25951:17d10 Homo 
sapiens cDNA /gb=W25951 




septin 6 


Xq24 
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/gi=1 306236 /ug=Hs. 123282 /len=738 








09_ 
at 


pinctor inri U41813"Human class I 
homeoprotein (HOXA9) mRNA, partial 
cds /cds=(0,389) , /gb=U41813 
/gi=1 184168 /ug=Hs.1 27428 /len=141 1 


HOXA 
9 


homeo box A9 


7p15- 
p14 


378 
11_ 
at 


Cluster Incl. AF042792:Homo sapiens 
alpha 2 delta calcium channel subunit 
; CA f nrm 1 mRNA comolete cds 
/cds=(1 61 ,3598) /gb=AF042792 
/gi=2781438 /ug=Hs.127436 /len=5463 


CACN 
A2D2 


calcium 
channel, 
voltage- 
dependent, 
alpha 2/delta 
subunit 2 


3p21.3 


378 
9 /_ 
s_at 


Cluster Incl. AI985964:wr79d08.x1 
u A mA c2ani<an^ nDNA 3 end 
/clone=IMAGE-2493903 /clone_end=3 
/gb=AI985964 /gi=5813241 
/ug=Hs.82961 /len=487 


TFF3 


trefoil factor 3 
(intestinal) 

• 


21q22. 
3 


379 
at 


Cluster Incl. D14520:Human mRNA for 
<—•/-> Rrtv hinHinn Drotein BTEB2, 
complete cds /cds=(558,1217) 
/gb=D14520 /gi=303596 /ug=Hs.84728 
/len=1301 


KLF5 


Kruppel-like 
factor 5 
(intestinal) 


13q21. 
32 


54_ 
at 


mnetor inri X1 fi662 - Human mRNA for 
vascular anticoagulant-beta (VAC- 
beta) /cds=(1 06, 1 089) /gb=X1 6662 
/gi=37638 /ug=Hs.87268 /len=1940 


ANXA 
8 


annexin A8 


10q11. 
2 


379 
67_ 
at 


Cluster Incl. AF000424:Homo sapiens 
LST1 mRNA, cLSTI/C splice variant 
complete cds /cds=(225,500 
/qb=AF000424 /gi=21 45063 


LY117 


lymphocyte 
antigen 117 


6p21 .3 
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l 

1 


fug=Hs.8841 1 /len=635 








379 
s_at 


Cluster Incl. AI436567:ti03b09.x1 

Uai^a eonionc r*nM A 3 ©fid 

Homo sapiens uuiNn, ^ 
/clone=IMAGE-21 29369 /clone_end=3 
/gb=AI436567 /gi=4282731 
/ug=Hs.89761 /len=680 


ATP5D 


ATP synthase, 
H+ transporting, 
mitochondrial 
F1 complex, 
delta subunit 


19p13. 
3 


380 
06_ 
at 


rM. .Mar in/^i iwi^wfifi* Human MEM-102 

L/lUSt6r InCI. IVlOf /QO.nuiiian ivii— ivi 

glycoprotein mRNA, complete cds 
/cds=(20,751) /gb=M37766 /gi=187518 
/ug=Hs.901 /len=1058 


CD48 


CD48 antigen 
(B-cell 
membrane 
protein) 


1q21.3 
-q22 


380 
17_ 
at 


CIust6r inci. uuozoy.nui ncm ivid i 
gene, complete cds /cds=(36,716) 
/gb=U05259 /gi=452561 /ug=Hs.79630 
/len=1107 








380 
37_ 
at 


Cluster Incl. M60278:Human heparin- 
binding EGF-like growth factor mRNA, 
complete cds /cds=(26 1,887) 

/«u-iv/icno7Q /ni— 1 ft^ftfifi /nn=Hs799 
/gD=IViDU^/0 /gi- loooou /uy i 10./^%/ 

/Ien=2342 


DTR 


diphtheria toxin 
receptor 
(heparin-binding 
epidermal 
growth factor- 
like growth 
factor) 


5q23 


380 
63_ 
at 


Cluster Incl. U00952:Human clone 
AQAODDR7 /r^AP^n/fCVTG^n reoeat- 
containing mRNA /cds=UNKNOWN 
/gb=U00952 /gi=405054 /ug=Hs.8068 
/len=1047 








380 
95 J 
_at 


Cluster Incl. M83664:Human MHC 
class II lymphocyte antigen (H LA-DP) 
beta chain mRNA, complete cds 
/cds=(59,835) /gb=M83664 /gi=1 88478 


HLA- 
DPB1 


major 

histocompatibilit 
y complex, class 
II, DP beta 1 


6p21 .3 



743 



BNSDOCID: <WO 03039443A2_I_> 



WO 03/039443 



PCT/EP02/12303 





0ig=Hs.814/len=1501 




I, DP beta 1 




380 
96_ 
f_at 


Cluster Incl. M83664:Human MHC 
class ii lympnocyie anuycii v. r,, - r ^ vjx > 
beta chain mRNA, complete cds 
/cds=(59,835) /gb=M83664 /gi=1 88478 
/ug=Hs.814/len=1501 


HLA- 
DPB1 


major 

histocompatibilit 
y complex, class 
II, DP beta 1 


6p21.3 


380 
97_ 
at 


ClUSt©r inCI. nrU IUO lO.nuiMU oapi^iKJ 

Pig8 (PIG8) mRNA, complete cds 
/cds=(72, 1 028) /gb=AF01 031 3 
/gi=2415301 /ug=Hs.8141 /len=2165 


PIG8 


etoposide- 
induced mRNA 


11q24 


381 
12_ 

9_a 
t 


"Cluster Incl. X15998:H.sapiens mRNA 
for the chondroitin sulphate 
proteoglycan versican, v i »p»o«= 
variant; precursor peptide 
/cds=(266,7495) /gb=X15998 
/gi=37662 /ug=Hs.81800 /len=8224" 


CSPG 
2 


chondroitin 
sulfate 

proteoglycan 2 
(versican) 


5q14.3 


I 381 
16_ 
at 


/m,,»i,, i _i r\-i y|CC"7-i-ii iman mRNA for 

Cluster Incl. ui4oo/.riurneiii mru>M-i iwi 
KIAA0101 gene, complete cds 
/cds=(61 ,396) /gb=D14657 /gi=285938 
/ug=Hs.81892/len=836 


KIAA0 
101 


KIAA0101 gene 
product 


15q22. 
1 


381 
47_ 
at 


Cluster Incl. AL023657:Homo sapiens 
SH2D1A cDNA, formerly known as 

noun uj-_/')QQ coc\ /nh=AI 023657 

/gi=3153107 /ug=Hs.151544 /len=2530 


SH2D1 
A 


SH2 domain 
protein 1A, 
Duncan's 
disease 

(lymphoprolifera 
tive syndrome) 


Xq25- 
q26 


381 
94_ 
s_a1 


Cluster Incl. M63438:Human Ig 
rearranged gamma chain mRNA, V-J- 
: C region and complete cds 
/cds=(0.1049) /gb=M63438 /gi=1 84847 


IGKC 

T 


immunoglobulin 
kappa constant 


2p12 
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fug=Hs.156110/len=1244 




i 




382 
25_ 
at 


Cluster Incl. AF052728:Homo sapiens 
HERG-USO (HERG) mRNA, 
alternatively spucea, panidi wj=> 
/cds=(0,284) /gb=AF052728 
/gi=3549258 /ug=Hs. 165664 /len=767 


KCNH 
2 


potassium 
voltage-gated 
channel, 
subfamily H 
(eag-related), 
member 2 


7q35- 
q36 


382 
33_ 
at 


Cluster Incl. Aruyozoo.nomu s>dpicu» 
homer-3 mRNA, complete cds 
/cds=(90,1 1 66) /gb=AF093265 
/gi=3834620 /ug=Hs.166146 /len=1407 


HOME 
R-3 


Homer, 
neuronal 
immediate early 
gene, 3 


19p13. 
12 


382 
69_ 
at 


"Cluster Incl. AL050147:Homo sapiens 
mRNA; cDNA DKrZpoobtuo^u tjrom 
clone DKFZp586E0820) /cds=(0,1630) 
/gb=AL050147 /gi=4884153 
/ug=Hs.91 146 /len=1837" 


PKD2 


protein kinase 
D2 


19q13. 
2 


383 
12_ 
at 


"Cluster Incl. AL050002:Homo sapiens 
mRNA; CDNA UKrZ.poo4»_>zzz 
done DKFZp5640222) 
/cds=UNKNOWN /gb=AL050002 
/gi=4884256 /ug=Hs.94795 /len=1546" 








383 
19_ 
at 


Cluster Incl. AA919102:ol84h02.s1 
Homo sapiens cuinm, o c,|U 
/clone=IMAGE-1 536339 /clone_end=3 
/gb=AA919102 /gi=3058992 
/ug=Hs.95327 /len=622 


CD3D 


CD3D antigen, 
delta 

polypeptide 
(TiT3 complex) 


11q23 


383 
36_ 
at 


Cluster Incl. AB023230:Homo sapiens 
mRNA for KIAA1013 protein, partial 
cds /cds=(0,3188) /gb=AB023230 
/gi-4589675 /ug=Hs.96427 /len=4783 


KIAA1 
013 


KIAA1013 
protein 


3 
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/gi=4589675 /ug=Hs.96427 /len=4783 








383 
61_ 

9_a 
t 


Cluster Incl. AI688812:wd41c03.x1 

Ul^rv-ii^ canipnc /""DMA 3 6110 

/clone=IMAGE-2330692 /clone_end=3 
/gb=AI688812 /gi=4900106 
/ug=Hs.99491 /len=504 


RASG 
RP2 


RAS guanyl 
releasing 
protein 2 
(calcium and 
DAG-regulated) 


11q13 

• 


383 
91_ 

at 


Cluster Incl. M94345:Homo sapiens 
macropnage cappiny pn-Hcm ii"^^! 
complete cds /cds=(49,1095) 
/gb=M94345 /gi=1 87455 /ug=Hs.82422 
/len=1221 


CAPG 


capping protein 
(actin filament), 
gelsolin-like 


2cen- 
q24 


384 

A A 

14_ 
at 


Cluster Incl. U05340: Human p55CDC 
mKiNA, complete ouo /ouo • ^» » w+s; 
/gb=U05340 /gi=468031 /ug=Hs.82906 
/len=1686 


CDC2 
0 


CDC20 (cell 
division cycle 
20, S. 
cerevisiae, 
homolog) 


9q13- 
q21 


384 

A C 

1 5_ 
at 


Cluster Incl. U14603:Human protein- 
tyrosine pnospncucioe \> ■ 1 *j 
mRNA, partial sequence 
/cds=(423,926) /gb=U 14603 
/gi=894158 /ug=Hs.82911 /len=1526 


PTP4A 

2 


protein tyrosine 
phosphatase 
type IVA, 
member 2 


1p35 


384 
35_ 

at 


Cluster Incl. U25182:Human 
antioxiQani enzyme rvw'i— o i mm 
complete cds /cds=(43,858) 
/gb=U25182 /gi=799380 /ug=Hs.83383 
/len=921 


PRDX 
4 


peroxiredoxin 4 


Xp22.1 
3 


384 
65_ 
at 


Cluster Incl. M37721:Human 
peptidylglycine alpha-amidating 
monooxygenase mRNA, complete cds 
/cds=(1 88,31 1 2) /gb=M37721 


PAM 


peptidylglycine 

alpha-amidating 

monooxygenase 


5q14- 
q21 
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/gi=1 89594 /ug=Hs.83920 /len=3748 




i 




384 
72_ 
at 


Cluster inci. udj4( /.nurndii mrvi^v-v iui 
KIAA0143 gene, partial cds 
/cds=(0,2658) /gb=D63477 
/gi=1 469867 /ug=Hs.84087 /len=5286 


KIAA0 
143 


KIAA0143 
protein 


8q24.2 
2 


384 
85_ 
at 


Cluster Incl. AA760866:nz14h07.s1 
Homo sapiens cDNA /clone=IMAGE- 

128/OUO /gD— Anl OUODD /yi — 'Ou" 

/ug=Hs.84549 /len=553 


NDUF 
C1 


NADH 

dehydrogenase 
(ubiquinone) 1 , 
subcomplex 
unknown, 1 
(6kD, KFYI) 


4q28.2 
-q31.1 


384 
87_ 
at 


Cluster Incl. Uo/4oo.numan mr\iNM iui 
KIAA0246 gene, partial cds 
/cds=(0,6639) , /gb=D87433 
/gi=1 665760 /ug=Hs.84753 /len=6777 




• 




385 
77_ 
at 


Cluster Incl. uy^yui.nomo sapiens 
clone DT1P1B6 mRNA, CAG repeat 
region /cds=UNKNOWN /gb=U92981 
/gi=2781399 /ug=Hs.18081 /len=1429 








385 
78_ 
at 


Cluster Incl. M63928:Homo sapiens T 
cell activation antigen mru>u-v, 
complete cds /cds=(1 00,882) 
/gb=M63928 /gi=1 80084 
/ug=Hs. 180841 /len=1204 


TNFR 
SF7 

\J l ■ 


tumor necrosis 
factor receptor 
superfamily, 
member 7 


12p13 


386 
53_ 
at 


Cluster Incl. D11428:Homo sapiens 
mRNA for PMP-22(PAS-ll/SR13/Gas- 
3), complete cds /cds=(1 88,670) 
/gb=D11428 /gi=220009 
/ug=Hs. 103724 /len=1806 


PMP2 

2 


peripheral 
myelin protein 
22 


17p12- 
p11.2 
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386 < 

66_ 

at 


Cluster Incl. M85169:Human 

\r\r\r*-\r\\r\ry\ ID rkf \/paQt S©C7 ITlRNA, 

complete cds /cds=(69,1265) 
/gb=M85169 /gi=338001 /ug=Hs.1050 
/len=3301 


PSCD | 
1 


pleckstrin 
homology, Sec7 
and coiled/coil 
domains 
1(cytohesin 1) 


17q25 




386 
95_ 
at 


Cluster Incl. AA203303:zx55b01.r1 
Homo sapiens cDNA, 5 end 
/clone=IMAGE-446377 /clone_end=5 
/nh-AAomw /ai=1799194 
/ug=Hs. 10758 /len=876 


NDUF 

S4 ! 


NADH 

dehydrogenase 
(ubiquinone) Fe- 
S protein 4 
(18kD) (NADH- 
coenzyme Q 
reductase) 


5q11.1 ' 


387 
05_ 
at 


Cluster Incl. AI310002:qo77c11.x1 
Homo Sapiens cuinm, o 
/clone=IMAGE-1 91 4548 /clone_end=3 
/gb=AI310002 /gi=4004873 
/ug=Hs. 108332 /len=656 


UBE2 
D2 


ubiquitin- 
conjugating 
enzyme E2D 2 
(homologous to 
yeast UBC4/5) 


5p14.2 
-q23.3 


387 
28_ 
at 


^i.,^- ini^i npfiQ7R'Hi]tnan mRNA for 
KIAA0225 gene, partial cds 
/cds=(0,6043) /gb=D86978 
/gi=1 504029 /ug=Hs.84790 /len=6237. 


KIAA0 
225 


KIAA0225 
protein 


7q33 


387 
30_ 
at 


Homo sapiens /REF=AB020671 

/ncc- C*\\ icfar Inrl mRNA for 

/Utr- Lriusier inw. • ||i,u,ri ,v 
KIAA0864 protein, partial cds 
7cds=(0,3656) /gb= /gi=4240216 
/ug=Hs.84883 /len=4319 /LEN=4481 


KIAA0 
864 


KIAA0864 
protein 


17p.11. 
2 




387 
35_ 
at 


Cluster Incl. AB011085:Homo sapiens 
mRNA for KIAA0513 protein, complete 
cds /cds=(631,1866) /gb=AB011085 
/gi=3043549 /ug=Hs.85053 /len=7758 


KIAA0 
513 


KIAA0513 gene 
product 


16q24. 
1 
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387 
47_ 
at 


Cluster inci. mo la^o.numan *->uo4 
gene, promoter and /cds=(258,1415) 
/gb=M81945 /gi=409018 /ug=Hs.85289 
/len=2616 








387 
67_ 
at 


— — 1 ■) 

Cluster Incl. AF041037:Homo sapiens 
novel antagonist ot rur signaling 
(sprouty-1) mRNA, partial cds 
/cds=(0,419) /gb=AF041037 
/gi=2827283 /ug=Hs.88044 /len=1586 


SPRY 

1 


sprouty 
(Drosophila) 
homolog 1 
(antagonist of 
FGF signaling) 


4 


387 
80_ 
at 


Cluster Incl. J04794:Human aldehyde 
reductase mKNA, compiexe ous 
/cds=(60,1037) /gb=J04794 
/gi=1 78480 /ug=Hs.89529 /len=1 1 32 


AKR1 
Al 


aldo-keto 
reductase family 
1, member A1 
(aldehyde 
reductase) 


1p33- 
p32 


387 
91_ 
at 


Cluster Incl. D29643:Human mRNA for 
KIAA0115 gene, complete cds 
/cds=(106,1476) /gD-u^yo^o 
/gi=473936 /ug=Hs.89674 /len=1668 


DDOS 
T 


dolichyl- 
diphosphooligos 
accharide- 
protein 

glycosyltransfer 
ase 


1p36.1 


388 
08_ 
at 


Cluster Incl. D64154:Human mRNA for 
Mr 110,000 antigen, complete ous 
/cds=(46,1269) /gb=D64154 
/gi=994759 /ug=Hs.90107 /len=1375 


GP110 


cell membrane 

alvcoorotein, 

110000M(r) 

(surface 

antigen) 


20q13. 
33 


388 
12_ 
at 


Cluster Incl. A/yboo.n. sapiens l^mvid^. 
mRNA for beta2 laminin 
/cds=(1 65,5561 ) /gb=X79683 
/gi=663206 /ug=Hs.90291 /len=5673 


LAMB 
2 


laminin, beta 2 

I W 1 1 III 111 » J 

(laminin S) 


3p21 


388 


Cluster Incl. AF039103:Homo sapiens 


HTATI 


HIV-1 Tat 


11p15. 



749 



BNSDOCID: <WO 03039443A2_I_> 



WO 03/039443 



PCT/EP02/12303 



at 


Tot inlarortinn nrntoin TIP30 mRNA 

1 ai*-inie radii iy jjiuidii imow hiixi^#-a, 
complete cds /cds=(1 2,740) 
/qb=AF039103 /gi=3043926 
/ug=Hs.90753 /len=1297 


P2 


interactive 
protein 2, 30 
kDa 


1 


388 
33_ 
at 


r^ino+^r in^l YnfiA^7'l-liiman mRNA for 
LrlUSt©! inCI. AUUhQ / .nul Ileal I iiirxiNr-v 

SB classll histocompatibility antigen 
alpha-chain /cds=(0,702) /gb=X00457 
/qi=36405 /ug=Hs.914 /len=1048 


HLA- 
DPA1 


major 

histocompatibilit 
y complex, class 
II, DP alpha 1 


6p21.3 

1 

• 


388 
58_ 
at 


Cluster Incl. U04270:Human putative 
potassium channel subunit (h-erg) 
mKNA, complete cas /ca*— v. \ oo,ow£.f 
/gb=U04270 /gi=487737 
/uq=Hs.1 88021 /len=4070 


KCNH 
2 


potassium 
voltage-gated 
channel, 
subfamily H 
(eag-related), 
member 2 


7q35- 
q36 


388 
79_ 

at 


Cluster Incl. D83664:Human mRNA for 

UAArl ^caiCiUrn-DinCiii iy jjiuicim mi 

amniotic fluid 1), complete cds 
/cds=(68,346) /qb=D83664 
/gi=1 502286 /uq=Hs.19413 /len=466 


S100A 
12 


S100 calcium- 
binding protein 
A12 (calgranulin 
C) 


1q21 


388 
93_ 
at 


Cluster Incl. AL008637:Human DNA 
sequence from clone 833B7 on 
chromosome 22q1 2.3-1 3.2 Contains 
genes for NCF4 (P40PHOX) 
protein, cytokine receptor common beta 
cnain precursor vjor^r\D .^011.1017, 
ESTs, CA repeat, STS, GSS 
/Cds=(629,1648) /gb=AL008637 
/gi=3136 








388 
94_ 
q a 


Cluster Incl. AL008637:Human DNA 
sequence from clone 833B7 on 
chromosome 22q1 2.3-1 3.2 Contains 
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t ~n 


genes for NCF4 (P40PHOX) 
protein.cytokine receptor common beta 
chain precursor oorzr\D 
ESTs, CA repeat, STS, GSS 
/cds=(629, 1 648) /gb=AL008637 
/gi=3136 








388 

95_i 

_at 


Cluster inci. a/ /uy^-.n. sapiens mrMNn 
for p40phox /cds=(130,1149) 
/gb=X77094 /gi=458543 
/ug=Hs. 1 96352 /len=1 245 


NCF4 


neutrophil 
cytosolic factor 
4 (40kD) 


22q13. 
1 


389 
17_ 
at 


Cluster Incl. X7oo i /.n. sapiens mr\iN/A 
for T-cell receptor delta 
/cds=UNKNOWN /gb=X73617 
/gi=402624 /ug=Hs.2014 /len=2343 








389 
49_ 
at 


Cluster Incl. L01087:Human protein 
kinase C-theta (rKi\u i ; mr\iNA\, 
complete cds /cds=(94,2214) 
/gb=L01087 /gi=558098 
/ug=Hs.21 1 593 /len=2754 


PRKC 
Q 

Vac 


protein kinase 
C, theta 


10p15 


389 
63_i 
_at 


Cluster Incl. U12707:Human Wiskott- 
Aldrich syndrome protein h/vmoi-; 
mRNA, complete cds /cds=(34,1542) 
/gb=U12707 /gi=695150 /ug=Hs.2157 
/len=1806 


WAS 


Wiskott-Aldrich 
syndrome 
(eczema- 
thrombocytopen 

ia) 


Xp11.4 
p11.21 


389 
92_ 
at 


Cluster Incl. X64229:H. sapiens dek 
mRNA /cds=(33,1160) /gb=X64229 
/gi=30502 /ug=Hs.1 10713 /len=2699 


DEK 


DEK oncogene 
(DNA binding) 


6p23 


389 
94_ 
at 


Cluster Incl. AF037989:Homo sapiens 
STAT-induced STAT inhibitor-2 mRNA, 
complete cds /cds=(317,913] 


STATI 
2 


STAT induced 
STAT inhibitor-2 


12q 



751 



BNSDOCID: <WO 03039443A2J_> 



WO 03/039443 



PCT/EP02/12303 



at 1 


fgb=AF037989 /gi=3265032 
/ug=Hs.1 1 0776 /len=1 937 








61_ 
at 


nincter inri D28137Human mRNA for 
BST-2, complete cds /cds=(9,551) 
/gb=D28137 /gi=457563 
/ug=Hs.118110/len=996 


BST2 


bone marrow 
stromal cell 
antigen 2 


19p13. 
2 

• 


390 
62_ 
at 


Cluster Incl. AL008726:dJ337O18.2 
(Lysosomal Protective Protein 
rkro<-nr<ir»r (PC 3 4 16 5 Cathepsin A. 

PlCV-rUIOUl I L.V O . " . 1 W . W j VWH VV^wiii - ») 

Carboxypeptidase C)) /cds=(133,1575) 
/gb=AL008726 /gi=31 83870 
/ug=Hs. 1 1 81 26 /len=1 946 








390 
at 


Cluster Incl. U03057:Human actin 

KunHlinn nrntpin (HSN 1 ) mRNA, 

complete cds /cds=(1 11,1 592) 
/gb=U03057 /gi=458027 
/ug=Hs.1 18400 /len=2767 


SNL 


singed 

(Drosophila)-like 
(sea 1 urchin 
fascin homolog 
like) 


7p22 


3yu 
89_ 
at 


rMnetor inH Yn7fif)4 - H saoiens mRNA 
for nucleoside-diphosphate kinase 
/cds=(1 1,574) /gb=Y07604 
/gi=1 945761 /ug=Hs.9235 /len=879 


NME4 


non-metastatic 
cells 4, protein 
expressed in 


16p13. 
3 


391 
1 o_ 
at 


Cluster Incl. L08069: Human heat 

ohnr-lr nrnt^in F COli DnaJ tlOmOlOQUe 

mRNA, complete cds /cds=(82,1275) 
/gb=L08069 /gi=306713 /ug=Hs.94 
/len=1438 


DNAJ 
A1 


DnaJ (Hsp40) 
homolog, 
subfamily A, 
member 1 


9p13- 
p12 


391 
36_ 
at 


Cluster Incl. AB017642:Homo sapiens 
mRNA for oxidative-stress responsive 
1, complete cds /cds=(342,1925) 
/qb=AB017642 /gi=4519628 


0SR1 


oxidative-stress 
responsive 1 


3p22- 
p21.3 
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» 


/ug=Hs.95220 /len=4519 








391 
65_ 
at 


/-ni, .^f-.-. ir^^i i M7ini'Hnmiin NiflJ-likp 
OlUStSr inCI. \J*tf lUl.nuiiiaii muu nr\c 

protein (hNifU) mRNA, partial cds 
/cds=(0,366) /gb=U47101 /gi=1685101 
/ug=Hs.9908 /len=819 


NIFU 


nitrogen fixation 
cluster-like 


12q24. 
1 


391 
70_ 
at 


"Cluster Inch AL049957:Homo sapiens 

clone DKFZp564J0323) 
/cds=UNKNOWN /gb=AL049957 
/gi=4884209 /ug=Hs.99766 /len=2180" 








391 
79_ 
at 


Cluster Inci. Z26248:H.sapiens mRNA 
for eosinophil granule major basic 
protein /cds=(857,1525) /gb=Z26248 
/gi = 94Q5lU /ug— ns.yyyD^. /ien— 100/ 


PRG2 


proteoglycan 2, 
bone marrow 
(natural killer 
cell activator, 
eosinophil 
granule major 
basic protein) 


11q12 


392 
62_ 
at 


dUSlGT InCI. U / y^DO.nuruail uuuc 

23627 mRNA, complete cds 
/cds=(1 84, 1 041 ) /gb=U79266 
/gi=1710225 /ug=Hs.23642 /Ien=1561 


HSU79 
266 


protein 

predicted by 
clone 23627 


11q13. 
1 


393 
01_ 
at 


L/IUSier InCI. AoOUOU.n.oafJldlo Minim 

for skeletal muscle-specific calpain 
/cds=(0,2465) /gb=X85030 /gi=791039 
/ug=Hs.239689 /len=2466 


CAPN 

3 


calpain 3, (p94) 


15q15. 
1- 

q21.1 


393 
27_ 
at 


Cluster Incl. D86983:Human mRNA for 
KIAA0230 gene, partial cds 
/cds=(0,4490) /gb=D86983 
/gi=1 504039 /ug=Hs.1 1 8893 /len=551 0 


D2S44 
8 


Melanoma 
associated gene 


2pter- 
p25.1 
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393 
58_ 
at 


Cluster Incl. U37146:Human silencing 
mediator of retinoid and thyroid 
hnrmnnp action CSMRT) mRNA, 
complete cds /cds=(495,4982) 
/gb=U37146 /gi=1 045654 
/ug=Hs. 120980 /len=5970 






• 


one 

10_ 
at 


oinctiar inrl X"1fi665 - Human HOX2H 
mRNA from the Hox2 locus 
/cds=(78,1148) /gb=X16665 /gi=32381 
/ug=Hs.2733 /len=1520 


HOXB 
2 


homeo box B2 


17q21- 
q22 


396 
49_ 
at 


Cluster Incl. X78817:H.sapiens partial 
C1 mRNA /cds=(42,2882) /gb=X78817 
/gi=840785 /ug=Hs.3109 /len=3236 


ARHG 
AP4 


Rho GTPase 
activating 
protein 4 


Xq28 


396 
70_ 
at 


"Cluster Incl. AL050034:Homo sapiens 
mRNA; cDNA DKFZp566G0224 (from 

rtr\r\a nKF7n*^fifiG0224) /cdS=(0 1380) 

/gb=AL050034 /gi=4884274 
/ug=Hs.33573 /len=1762" 


ADPR 
TL3 


"ADP- 

ribosyltransferas 
e (NAD+; poly 
(ADP-ribose) 
poiymerase)-like 
3" 


3p22.2 
-p21.1 


396 
at 


Cluster Incl. AI362017:qy39a10.x1 

Unrnn canbnQ rDNA 3 SPICl 

/clone=IMAGE-201 4362 /clone_end=3 
/gb=AI362017 /gi=41 13638 
/ug=Hs. 135084 /len=778 


CST3 


cystatin C 
(amyloid 
angiopathy and 
cerebral 
hemorrhage) 


20p11. 
2 


oy / 
30_ 
at 


Piii"?tpr Incl X16416 - Human c-abl 
mRNA encoding p150 protein 
/cds=(1 47,3539) /gb=X16416 
/gi=28236 /ug=Hs. 146355 /len=5527 


ABL1 


v-abl Abelson 
murine leukemia 
viral oncogene 
homolog 1 


9q34.1 


397 
55 


Cluster Incl. Z93930: Human DNA 
sequence from clone 292E10 on 
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• 


chromosome 22q1 1-1 2. Contains the 
XBP1 gene for X-box binding protein 1 
nrREB51 ESTs STSs, GSSs and a 
putative CpG island /cds=(30,81 5) 
/gb=Z93930 /gi=4775603 
/ug=Hs.1 49923 /len=1 802 










397 
56_ 

9_a 
t 


Cluster Incl. Z93930:Human DNA 
sequence from clone 292E10 on 
chromosome 22q11-12. Contains the 
XBP1 gene for X-box binding protein 1 
rrRFR5^ ESTs STSs. GSSs and a 
putative CpG island /cds=(30.81 5 ) 
/gb=Z93930 /gi=4775603 
/ug=Hs.149923 /len=1802 








oy / 
75_ 
at 


niuct^r incl X54486'Human gene for 
C1 -inhibitor /cds=(60,1562) 
/gb=X54486 /gi=29534 /ug=Hs.151242 
/len=1827 








398 

U 1 

at 


Cluster Incl. AF046889:Homo sapiens 
IwqvI hvdroxvlase isoform 3 (PLOD3) 
mRNA, complete cds /cds=(2 16,2432) 
/gb=AF046889 /gi=31 53234 
/ug=Hs. 153357 /len=2735 


PLOD 

3 

i 


procollagen- 
lysine, 2- 
oxoglutarate 5- 
dioxygenase 3 


7q22 


398 
s_at 


Cluster Incl. AI052724:oz27a12.x1 
Hr>mr» «;aDiens cDNA, 3 end 
/clone=IMAGE-1 676542 /clone_end=3 
/gb=AI052724 /gi=3308715 
/ug=Hs. 109201 /len=682 


LOC51 
635 


CGI-86 protein 


14q23. 
1 




398 
24_ 
at 


Cluster Incl. AI391564:tg16b02.x1 
Homo sapiens cDNA, 3 end 
/clone-IMAGE-21 08907 /clone end=2 




ESTs, Weakly 
similar to 
A28996 proline- 
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/gb=AI391 564 /gi-^z i / ooo 
/ug=Hs. 110820 /len=442 




rich protein M14 
precursor 
mouse 
[M.musculus] 




398 
27_ 
at 


Cluster Incl. AA522530:ni38d12.s1 

■ ■ * — r*"v k i a Ann 

Homo sapiens cDNA, o ena 
/clone=IMAGE-979127 /clone_end=3 
/gb=AA522530 /gi=2263242 
/ug=Hs.111244/len=891 


FLJ20 
500 


hypothetical 
d rote in 


10pter- 
q26.12* 


398 
60_ 
at 


Cluster Incl. U05040:Human FUSE 
binding protein mRNA, complete cds 
/cds=(26,1960) /go-uuou^u 
/gi=460151 /ug=Hs.1 18962 /len=2325 




Homo sapiens 
far upstream 
element (FUSE) 
binding protein 
1 (FUBP1), 
mRNA 




399 
21_ 
at 


Cluster Incl. AI526089.UUo.^-/.nu/.r 
Homo sapiens cDNA, 5 end 
/clone_end=5 /gb=AI526089 
/gj=4440207 /ug=Hs.1342 /len=788 


HOX5 
B 


cytochrome c 
oxidase subunit 
Vb 


2cen- 
q13 


399 
29_ 
at 


Cluster Incl. AB02313y.nomo sapiens 
mRNA for KIAA0922 protein, partial 
cds /cds=(0,2372) /gb=AB023139 
/gi=4589475 /ug=Hs. 37892 /len=2505 


KIAA0 
922 


KIAA0922 
protein 


4q31.3 


399 
68_ 
at 


Cluster Incl. U501 36: Human 
leukotnene C4 syntnase ^li^^o; 
gene, complete cds /cds=(96,548) 
/gb=U50136 /gi=1314482 /ug=Hs.456 
/len=665 








399 
93 


Cluster Incl. D11466:Homo sapiens 
mRNA for PIG-A protein, complete cds 


PIGA 


phosphatidylino 
sitol glycan, 


Xp22.1 
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» 


/cds— (oo, i ooyj i\i u u||wu 
/gi=219993 /ug=Hs.51 /len=3589 




class A 
(paroxysmal 
nocturnal 
hemoglobinuria) 




400 
81_ 
at 


Cluster Incl. L26232:Human 
pnospnoiipio transier proiem ihimw, 
complete cds /cds=(87,1568) 
/gb=L26232 /gi=468325 
/ug=Hs.1 54854 /len= 1750 


PLTP 


phospholipid 
transfer protein 


20q12- 
q13.1 


401 
59_ 
r_at 


Cluster Incl. M55067:Human 47-kD 
autosomal chronic granulomatous 
disease protein hikna, compieie tu& 
/cds=(22, 1 1 94) /gb=M55067 
/gi= 189050 /ug=Hs.1583 /len=1349 


NCF1 


neutrophil 
cytosolic factor 
1 (47kD, chronic 
granulomatous 
disease, 
autosomal 1) 


7q11.2 
3 


401 
98_ 
at 


Cluster Incl. L06132:Human voltage- 
dependent anion cnannei isoiuim • 
(VDAC) mRNA, complete cds 
/cds=(99,950) /gb=L06132 /gi=340198 
/ug=Hs.1491 55 /len=1 806 


VDAC 


voltage- 
dependent 
anion channel 1 


5q31 


402 
81_ 
at 


Cluster Incl. D63878: Human mRNA for 
KIAAOloo gene, compiews ^ a 
/cds=(258,1343) /gb=D63878 
/gi=961447 /ug=Hs. 155595 /len=3433 


NEDD 

5 ' 


neural precursor 
cell expressed, 
developmental^ 
down-regulated 
5 


2q37 


402 
82_ 
s_at 


Cluster Incl. M84526:Human 
adipsin/complement factor D mRNA, 
complete cds /cds=(54,740) 
/gb=M84526 /gi=1 78625 
/ug=Hs. 155597 /len=1071 


DF 


D component of 

complement 

(adipsin) 


19p13. 
3 
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403 
96_ 
at 


r*ine?tiar in<->i I lAQ'^Q'VHiiman ionotroDic 
ATP receptor P2X5a mRNA, complete 
cds /cds=(52,1317) /gb=U49395 
/gi=1 552521 /ug=Hs.77807 /len=1956 


P2RX5 


purinergic 
receptor P2X, 
ligand-gated ion 
channel, 5 


17p13 


404 
07_ 
at 


Cluster Incl. U28386:Human nuclear 

lOC3liZ3tlOn setJUtJlloc 1 coc r LVI 

hSRP1 alpha mRNA, complete cds 
/cds=(1 32,1 721) /gb=U28386 
/gi=899538 /ug=Hs. 159557 /len=1976 


KPNA 

2 


karyopherin 
alpha 2 (RAG 
cohort 1, 
importin alpha 

D 


17q23. 
1- 

q23.3 


404 
19_ 
at 


UlUStBT inCI. AOOI I O.n.odpic;i 10 cpuf^. 

gene exon 1 /cds=(61 ,927) 
7gb=X85116 /gi=1161561 
/ug=Hs. 160483 /len=3035 








404 
56_ 
at 


"Cluster Incl. AL049963:Homo sapiens 

mRNA, CUNA L>i\r^.poo*tM i 

clone DKFZp564A132) 
/cds=UNKNOWN /gb=AL049963 
/gi=4884213 /ug=Hs. 16726 /len=1322" 








404 
93_ 
at 


r^i. ,«^r \r\r-\ i n^4.94'HLiman 

OlUStBl inCI. LUJ*t^.t.l lUinau 

hyaluronate receptor (CD44) gene 
/cds=(1 36,2211) /gb=L05424 
/gi=950415 /ug=Hs. 169610 /len=2905 








405 
09_ 
at 


Cluster Incl. J04058:Human electron 
transfer flavoprotein alpha-subunit 
mRNA, complete cds /cds=(0,1001) 
ifi/inco /ai=1 82250 

/ug=Hs.169919 /len=1266 


ETFA 


electron- 
transfer- 
flavoprotein, 
alpha 

polypeptide 
(glutaric aciduria 

II) 


15q23- 
q25 


405 


Cluster Incl. D25538:Human mRNA for 


ADCY 


1 adenylate 


16q12- 
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85_ 
at 


K1AA0037 gene, complete cds 
/cds=(265,3507) /gb=D25538 
/gi=436217 /ug=Hs.172199 /len=6196 


7 


cyclase 7 

t 


q13 


406 
10_ 
at 


Cluster Incl. AI743507:wf72a06.x2 
Homo sapiens cuina, o enu 
/clone=IMAGE-2361 1 06 /clone_end=3 
/gb=AI743507 /gi=51 1 1 795 
/ug=Hs.1 7351 8 /len=733 


ZFR 


zinc finger RNA 
binding protein 


5p13.3 


406 
98_ 
at 


Cluster Incl. X96719:H.sapiens mRNA 
for AICL (activation-induced C-type 
lectin) /cds=(1 32,581) /gb=X96719 
/gi=1 63281 5 /ug=Hs. 85201 /len=739 

i 


CLEC 
SF2 


C-type (calcium 
dependent, 
carbohydrate- 
recognition 
domain) lectin, 
suoerfamilv 
member 2 
(activation- 
induced) 


12p13- 
P 12 


407 
18_ 
at 


Cluster Incl. ArUlool i .noma sapiens 
lymphopain mRNA, complete cds 
/cds=(0, 1 1 30) /gb=AF01 361 1 
/gi=2582044 /ug=Hs.87450 /len=1131 


CTSW 


cathepsin W 
(lymphopain) 


11q13. 
1 


407 
23_ 
at 


Cluster Incl. AJuiuuoy.nomo sapiens 
SIT protein /cds=(87,677) 
/gb=AJ010059 /gi=4688891 
/ug=Hs.88012 /len=1232 


SIT 


SHP2 
interacting 
transmembrane 
adaptor 


9p13- 
P12 


407 
63_ 
at 


Cluster Incl. U85707:Human 
leukemogenic homolog protein 
(MEIS1) mRNA, complete cds 
/cds=(65, 1 237) /gb=U85707 
/gi=2058550 /ug=Hs.170177 /len=2511 


MEIS1 


Meisl (mouse) 
homolog 


2p14- 
p13 
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407 

D 1 _ 

at 


Cluster Incl. M59499:Human 

iir-\/^rkrr*fciin-aQ^nr i iptpd coaoulation 

inhibitor (LACI) gene /cds=(2,916) 
/gb=M59499 /gi=1 87205 
/ug=Hs. 1 70279 /len=3599 








407 
75_ 
at 


Cluster Incl. AL021786:Human DNA 
sequence from PAC 696H22 on 
chromosome Xq21.1-21.2. Contains a 
mnMCD cok liUp npne a Kinesin like 
pseudogene and ESTs /cds=(0,680) 
/gb=AL021786 /gi=2853186 
/ug=Hs.17109/len=1389 






• 


407 
86_ 
at 


Cluster Incl. U37352: Human protein 
^hncnhntncQ 9A Ralnhal reaulatorv 

pnOSprialoot? £-r\ Daipiia ' icyuioiwijf 

subunit mRNA, complete cds 
/cds=(88,1632) /gb=U37352 
/gi=1 20381 1 /ug=Hs.171734 /len=4064- 


PPP2 
R5C 


protein 

phosphatase 2, 
regulatory 
subunit B (B56), 
gamma isoform 


3p21 


408 

A ~7 

1 /_ 

at 


Cluster Incl. M96824:Human 
r-w \ r>\ar\V\\r\c\\r\ nr^onr^or mRNA. 

nUCiGODinQli i pi cou i ov-M mi 

complete cds /cds=(39,1421) 
/gb=M96824 /gi=1 89307 
/ug=Hs. 1 72609 /len=1 650 


NUCB 
1 


nucleobindin 1 


19q13. 
2- 

q13.4 


408 
at 


Cluster Incl. U29953:Human pigment 

Ar^iih^lii im H»ri\/<=rl factor Q6I16. 

epiineiiun i-utsi i vcu ' aoiui y ,w > 
complete cds /cds=(136,1224) 
/gb=U29953 /gi=1 144298 
/ug=Hs. 1 73594 /len=1 51 1 








408 
64_ 
at 


Cluster Incl. D25274:Homo sapiens 
mRNA, clone-P02ST9 
/cds=UNKNOWN /gb=D25274 
/gi-464185 /ug=Hs. 173737 /len=1232 
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/ g j=464185 /ug=Hs. 173737 /len=1232 








408 

65 

at 


Cluster Incl. U51166:Human G/T 
mismatch-sDecific thymine DNA 
glycosylase mRNA, complete cds 
/cds=<399,163'1) /gb=U51166 
/gi=1378106 /ug=Hs. 173824 /len=3410 


TDG 


thymine-DNA 
glycosylase 


12q24. 
1 


409 

36 

at 


Cluster Incl. AI651806:wb55f10.x1 
Homo saDiens cDNA. 3 end 
/cione=IMAGEr230961 1 /clone_end=3 
/gb=AI651806 /gi=4735797 
/ug=Hs. 19280 /len=609 


CRIM1 


cysteine-rich 
motor neuron 1 


2p21 


410 
at 


Cluster Incl. AI126134:qd77c05.x1 

Hnmn ^flnipns cDNA 3 ©Pld 

/clone=IMAGE-1 735496 /clone_end=3 
/gb=AI126134 ' /gi=3594648 
/ug=Hs. 1 00000 /len=446 


S100A 
8 


S100 calcium- 
binding protein 
A8 (calgranulin 
A) 


1q21 


411 

38_ 
at 


Cluster Incl. M16279:Human MIC2 
mRNA, complete cds /cds=(1 77,734) 
/ab=M16279 /qi=1 88542 
/ug=Hs.1 77543 /len=1238 


MIC2 

• 


antigen 

identified by 
monoclonal 
antibodies 
12E7, F21 and 
013 


Xp22.3 
2 


411 
53_ 
f_at 


Cluster Incl. AF 102803: untitled 
/cds=(2,2722) /gb=AF1 02803 
/gi=4092760 /ug=Hs. 178452 /len=3668 








411 
55_ 
at 


Cluster Incl. U03100:Human 
alpha2(E)-catenin mRNA, complete 
cds /cds=(4,2799) /gb=U03100 
/gi=414981 7ug=Hs.178452 /len=3526 


CTNN 
A1 


catenin 
(cadherin- 
associated 
protein), alpha 1 
(102kD) 


5q31 
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(102kD) 




411 
56_ 

9_a 
t 


Cluster Incl. U031 00: Human 
alpha2(E)-catenin mRNA, complete 
cds /cds=(4,2799) /gb=U03100 
/gi=414981 /ug=Hs. 178452 /len=3526 


CTNN 
A1 


catenin 
(cadherin- 
associated 
protein), alpha 1 
(102kD) 


5q31 

• 


411 
63_ 
at 


Cluster Incl. AL109672:Homo sapiens 
mRNA full length insert cDNA clone 
EUROIMAGE 755868 /cds=(98,751) 
/gb=AL1 09672 /gi=5689836 
/ug=Hs.179516 /len=1378 


P24B 


integral type I 
protein 


15q24- 
q25 


41 1 
64_ 
at 


Cluster Incl. X67301:H. sapiens mRNA 
for IgM heavy chain constant region 
(Ab63) /cds={0,1361) /gb=X67301 
/gi=38407 /ug=Hs.1 79543 /len=1453 


IGHM 


immunoglobulin 
heavy constant 
mu 


14q32. 
33 


411 
65_ 

9_a 
t 


Cluster Incl. X67301:H.sapiens mRNA 
for IgM heavy chain constant region 
(Ab63) /cds=(0,1361) /gb=X67301 
/gi=38407 /ug=Hs.1 79543 /len=1453 

» 


IGHM 


immunoglobulin 
heavy constant 
mu 


14q32. 
33 


411 
66_ 
at 


Cluster Incl. X58529:Human 
rearranged immunoglobulin mRNA for 
mu heavy chain enhancer and 
constant region /cds=UNKNOWN 
/gb=X58529 /gi=33480 /ug=Hs.1 79543 
/len=2325 








411 
77_ 
at 


Cluster Incl. AW024285:wt69d06.x1 
Homo sapiens cDNA, 3 end 
/clone=IMAGE-251 271 5 /clone_end=3 
/gb=AW024285 /gi=5877815 


FLJ12 
443 


hypothetical 
protein 
FLJ 12443 


5p15.3 
3 
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1 


/ug=Hs. 1 79882 /len=550 




i 




A A i 

91_ 
at 


II 101. /ADLl£.0^-VJJ7.rii-/i i iv oa|jici io 

mRNA for KIAA0992 protein, partial 
cds /cds=(0,2318) /gb=AB023209 
/gi=4589627 Aig=Hs. 180347 /len=4347 


KIAAO 
992 


palladin 


4q32.3 


AAA 
41 1 

93_ 
at 


O1USI61 inCI. r\DVJ I OOO4C. nui I t\J ocajjici 10 

mRNA for DUSP6, complete cds 
/cds=(351,1496) /gb=AB013382 
/gi=3869139 /ug=Hs. 180383 /len=2390 


DUSP 
6 


dual specificity 
phosphatase 6 


12q22- 
q23 


412 

oo_ 

at 


Cluster Incl. Z22555:H.sapiens 
encooing l»la- i mKiNM /cas— ^o», ia»o; 
/gb=Z22555 /gi=397606 
/ug=Hs.1 80616 /len=2552 


CD36L 

1 


CD36 antigen 
fcollaaen tvoe 1 
receptor, 
thrombospondin 
receptor)-like 1 


12q24. 
31 


412 
20_ 
at 


oiuster inci. ADUZozuo.norno bdj-nciio 
mRNA for KIAA0991 protein, complete 
cds /cds=(732,2000) /gb=AB023208 
/gi=4589625 /ug=Hs.181002 7len=3938 


MSF 

ivIO 1 


MLL seotin-like 

| V ■ L> w \* *«/ bill III x w 

fusion 


17q25 


412 
73_ 
at 


Cluster Incl. 
AL046940:DKFZp586l0517_r1 Homo 
sapiens cljin/a, o cmu 
/clone=DKFZp586l051 7 /clone_end=5 
/gb=AL046940 /gi=5434999 
/ug=Hs.231657 /len=695 


• 






413 
38_ 
at 


Cluster Incl. AI951946:wx39f10.x1 
Homo sapiens cDNA, 3 end 
/clone=IMAGE-2546059 /clone_end=3 
/gb=AI951946 /gi=5744256 
/ug=Hs.244 /len=523 
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96_ 
at 


Philter Incl AB006629"Homo sapiens 
mRNA for KIAA0291 gene, partial cds 
/cds=(0,2856) /gb=AB006629 
/gi=2564329 /ug=Hs.104717 /len=4943 


CYLN2 


cytoplasmic 
linker 2 


7q11.2 
3 


70_ 
at 


r.ln<;tpr Incl AF027208'Homo sapiens 
AC133 antigen mRNA, complete cds 
/cds=(37,2634) /gb=AF027208 
/gi=2688948 /ug=Hs.1 12360 /len=3794 


PROM 
L1 


prominin 
(mouse)-like 1 


4p15.3 
3 


414 
at 


Cluster Incl. W72424:zd66a09.s1 

Unmn <i;anie*n^ cDNA 3 end 

Mwl 1 lv OupiCI IO vl— ' 1 j v/ wiiw 

/clone=IMAGE-345592 /clone_end=3 
/gb=W72424 /gi=1 382379 
/ug=Hs.1 1 2405 /len=604 


S100A 
9 


S100 calcium- 
binding protein 
A9 (calgranulin 
B) 


1q21 


03_ 
at 


Pinestpr Inrl AB020661 "Homo sapiens 
mRNA for KIAA0854 protein, complete 
cds /cds=(304,2817) /gb=AB020661 
/gi=42401 96 7ug=Hs.30209 /len=4089 


KIAA0 
854 


KIAA0854 
protein ' 


8q24.1 
3 


415 

GO 

at 


Cluster Incl. AF006484:Homo sapiens 

nntati\/p nral tumor ^unnressor DTOtein 

IJJJLClllVw Vl Ql IUI 1 Ivl v Upwl ww wwi |«/iwkwti» 

(doc-1) mRNA, complete cds 
/cds=(522,869) /gb=AF006484 
/gi=2738496 /ug=Hs.3436 /len=1608 


CDK2 
AP1 


CDK2- 
associated 
protein 1 


12q24. 
31 


415 
47 

at 


Cluster Incl. AF047472:Homo sapiens 
enlppn mitotic checkpoint BUB3 

OUIwd 1 III llwllw vi i w w» xw^ w » 1 • v 

(BUB3) mRNA, complete cds 
/cds=(70, 1 056) /gb=AF047472 
/gi=2921872 /ug=Hs.40323 /len=2585 


BUB3 


BUB3 (budding 
uninhibited by 
benzimidazoles 
3, yeast) 
homolog 


10q26 


416 
09_ 
at 


Cluster Incl. U15085:Human HLA-DMB 
mRNA, complete cds /cds=(233,1024) 
/gb=U15085 /gi=557701 /ug=Hs.1162 


HLA- 
DMB 


major 

histocompatibilit 
y complex, class 


6p21.3 
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at 


/len=1362 




II, DM beta 




416 
54_ 
at 


Cluster Incl. X02994:Human mRNA for 
adenosine deaminase ^aaenosnie 
aminohydrolase, EC 3.5.4.4) 
/cds=(95,1186) /gb=X02994 /gi=28379 
/ug=Hs.1217/!en=1498 


ADA 


adenosine 
deaminase 

\J W vll 1 III iL* WV 


20q12- 
q13.11 


416 
60_ 
at 


Cluster Incl. AL031588:dJ1163J1.1 
(ortholog of mouse transmembrane 
receptor Celsrl (KIAA0279 LIKE EGF- 
like domain containing protein similar 
to rat MEG /cds=(0,4433) 
/gb=AL031 588 /gi=40071 08 
/ug=Hs. 123043 /len=6438 


GTSE 
1 


G-2 and S- 
phase 

expressed 1 


22q13. 
2- 

q13.3 


416 
94_ 
at 


Cluster Incl. Ml7/54.riuman dinoi 
mRNA, complete cds /cds=(51,1238) 
/gb=M17754 /gi=179512 /ug=Hs.1276 
/len=1881 


DINO 1 1 


BN51 (BHK21) 
temperature 
sensitivity 
complementing 


8q21 


417 
23_ 
s_at 


Cluster Incl. M32578:Human MHC 
class II MLA-UK oeia- 1 mr\iN/\ 
(DR2.3), 5end /cds=(61 ,861) 
/gb=M32578 /gi=1 88305 
/ug=Hs. 1 81 366 /len=1 21 6 


HLA- 
DRR1 

Ul\U 1 

t 


major 

histocomDatibilit 
y complex, class 
II, DR beta 1 


6p21.3 


417 
34_ 
at 


OlUSter incl. AdU^ud/ /.riuiiiu bdpicno 

mRNA for KIAA0870 protein, complete 
cds /cds=(436,1998) /gb=AB020677 
/ g i=4240228 /ug=Hs.18166 /len=4484 


KIAA0 
870 


KIAA0870 
protein 


8q24.3 


417 
47_ 
s_at 


Cluster Incl. U49020:Human myocyte- 
specific enhancer factor 2A (MEF2A) 
gene, first coding /cds=(142,1662) 
/gb=U49020 /gi= 1 1 97536 
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/ug=Hs.1 82280 /len=5329 








417 

CO 

9_a 

t 


Cluster Incl. D64015:Homo sapiens 
m DMA for T-^inQt^r hindina orotein 
complete cds /cds=(1 57,954) 
/gb=D64015 /gi=2281005 
/ug=Hs. 182741 /len=1737 


T1AL1 


TIA1 cytotoxic 
granule- 
associated 
RNA-binding 
protein-like 1 


10q 

• 


417 
96_ 
at 


r^inofiar inH ARH9Q01 ^Hnmn ^aniens 
mRNA for KIAA1092 protein, partial 
cds /cds=(0,3464) /gb=AB029015 
/gi=5689520 /ug=Hs.54886 /len=4147 


PLCE2 


phospholipase 
C, epsilon 2 


3p24.3 


A A O 

418 
08_ 
at 


r^ii letar in#-*i A Pn^9 1 09" Hfimf) ^aniens 
clone 23926 mRNA sequence 
/cds=UNKNOWN /gb=AF052102 
/gi=3360409 /ug=Hs.5671 /len=1884 








418 
09_ 

at 


Cluster Incl. AI656421:tt50h10.x1 

U/smn conipnc pHNA 3 GPld 

/clone=IMAGE-2244259 /clone_end=3 
/gb=AI656421 /gi=4740400 
/ug=Hs.5671 /len=566 


MGC4 
175 


hypothetical 

protein 

MGC4175 


7q21.1 
-q21.2 


418 

VI "7 

47_ 

at 


Cluster Incl. AA214546:zr92c03.s1 

Ur>mn canbnQ pDNA 3 end 

/clone=IMAGE-683140 /clone_end=3 
/gb=AA214546 /gi=1813171 
/ug=Hs.66576 /len=516 


IL24 


interleukin 24 


1q32 


432 
_s_ 
at 


X02883 /FEATURE=cds 
/DEFINITION=HSTCRAC Human gene 
for T-cell receptor alpha chain C region 


TRA 


T cell receptor 
alpha locus 


14q11. 
2 


484 
at 


U59302 /FEATURE^ 
/DEFINITION=HSU59302 Human 


NCOA 
1 


nuclear receptor 
coactivator 1 


2p23 
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_at 


steroid receptor coactivator-1 F-SRC-1 
mRNA, complete cds 


1 


coactivator 1 




i— /-* f\ 

529 
_at 


U15932 /rcrtiur\t 
/DEFINITlON=HSU15932 Human 
dual-specificity protein phosphatase 
mRNA, complete cds 


DUSP 
5 


dual specificity 
phosphatase 5 


10q25 


538 
_at 


S53911 /FEATURE 3 
/DEFINITION=S53911 
CD34=glycoprotein expresseu m 
lymphohematopoietic progenitor cells 
{alternatively spliced, truncated form} 
[human, UT7, mRNA, 2657 nt] 


CD34 


CD34 antigen 


1q32 


585 
_at 


M30938 /FEATURE=mRNA#2 
/DEFINITION=HUMKUP Human Ku 
(p70/p80) subunit mRNA, complete 
cds 


XRCC 
5 


"X-ray repair 
complementing 
defective repair 
in Chinese 
hamster cells 5 
(double-strand- 
break rejoining; 
Ku autoantigen, 
80kD) M 


2q35 


605 
_at 


L78833 /FEATURE=exon#36 

ir.i-1-iMi-ripiM — uji iftflDDP A 1 Human 

BRCA1, Rho7 and vatl genes, 
complete cds, and ipf35 gene, partial 
cds 








706 
_at 


Glucocorticoid Receptor, Beta 








767 
at 


AF001548 /FEATU RE= m RN A 
/DEFINITION=HUAF001548 Human 
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_at 


Chromosome 16 BAC clone 
CIT987SK-A-81 5A9, complete 
sequence 








820 
_at 


U77604 , /FEATURE= 

/rMTiriMITIOM— M^l I77R04 Homo 

sapiens microsomal glutathione S- 
transferase 2 (MGST2) mRNA, 
complete cds 


MGST 
2 


microsomal 
glutathione S- 
transferase 2 


4q28- 
q31 

• 


854 
_at 


C7C o i7 /FEATURE= 
/DEFINITION=S76617 blk=protein 
tyrosine kinase [human, B 
lymphocytes, mRNA, 2608 nt] 


BLK 


B lymphoid 
tyrosine kinase 


8p23- 
p22 


931 
_at 


L08177 /FEATURE= 
/DEFINITION=HUMGPCRB Human 
EBV induced G-protein coupled 
receptor (tDiz; mrviN/A, uuinjjieic wuo 


EBI2 


Epstein-Barr 
virus induced 
gene 2 
(lymphocyte- 
specific G 
protein-coupled 
receptor) 


13q32. 
3 


932 

_i_a 

t 


mC7 o /FEATURE^ 
/DEFINITION=HUMKRUPZN Human 
Kruppel related zinc finger protein 
(HTF10) mRNA, complete cds 


ZNF91 


zinc finger 
protein 91 
(HPF7, HTF10) 


19p13. 
1-p12 


AAA 

933 
_f_ 
at 


mc7 o /FEATURE= 
/DEFINITION=HUMKRUPZN Human 
Kruppel related zinc finger protein 
(HTF10) mRNA, complete cds 


ZNF91 


zinc finger 
protein 91 
(HPF7, HTF10) 


19p13. 
1-p12 


958 
s 


Rna Polymerase II, 14.5 Kda Subunit 
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at 






• 




t 

AFF 

X- 

HU 

MR 

GE/ 

M1 

009 

8_ 

M_ 

at 


M10098 Human 18S rRNA gene, 
complete (_5, _M, _3 represent 
transcript regions 5 prime, Middle, and 
3 prime respectively) 
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Table 22: 



Pairyvise Comparisons 



1 

s 

■5* 



r — \ 

i j 


nBiyi I CLL \ CML 


'".ALL | AML I 




nBM{ 


J K1A0952.. 3NL j 1RF4. HLA-DMB 


>iSCRt;KCNH2 j 
K1AA0462* 


PuscRi;cAMp : 


n=8 


ICLL 


' *V • : 

1 00% (1.00) i TRB.HIA-OMB 


NOCA1 U923B1* j 


PO02AF1, . 
TNFRSF7 


n=8; 


CML 


i00% (-1.00i 100% (1-00) • -i } 


I CLC,tAtDpr i 


DEFA3, LCN2; 
SOP23*- 


n=10 


ALL 


100% (0.97) i 00% (1.00) 1GO%.(1.00) ^ : < 


OS-9YLEF1 1 MSF 
PPGB.APUP2: 


n=18 


AML 


100% (1.0oi 100% (T.OOj 100% : (0.96) 97% (0:95) j 


... 


\ n=59 




n=§ n=8 n=10 


i n=18 












\ % - accuracy in leave-one-out cross validation 






r"Q' = confidence: ! 


I i 




i 
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Table 23: 
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Golub 


diffgenes 






A.- Samples: 18/85 


A- samples: 18/85 


• 


Accuracy 0,87 


accuracy 0,96 


Confidenc 

e 0,77 


confidenc 

e 0,88 


Failed 6,19,22,26,78,79,80,81,82,83,84,85,99 


failed 5,6,19,22 






Gene 


Signal-to- 
noise 


p 


decision 
limit 


gene 


signal-to- 
noise 

i 


p 


decision 
limit 


gl 


-1,14 


0* 


482,01 


gl 


-1,14 


0 




g2 


-1,06 


0* 


192,17 


g2 


-1,06 


0* 


98,50 


g3 


-0,97 


0* 


207,67 


g3 


-0,97 


0 




g4 


0,94 


0* 


205,05 


g4 


0,94 


0 




g5 


-0,93 


0* 


1818,11 


g5 


-0,93 


0 




g6 


0,93 


0* 


451,74 


g6 


0,93 


0 




g7 


-0,91 


0* 


23,84 


g7 


-0,91 


0 




g8 


-0,90 


0* 


225,72 


g8 


-0,90 


0 





772 

BNSDOC1D: <WO 03039443A2_1_> 



WO 03/039443 



PCT/EP02/12303 



gio 



fell 



512 



U3 



g!4 



gl5 



gl6 



U7 



gl8 



gl9 



g20 



0,90 



0,89 



0,88 



,87 



,87 



,86 



,85 



-0,85 



0,84 



-0,84 



-0,83 



0,46 



0* 



0* 



0* 



0* 



0* 



0" 



0* 



0* 



0* 



0* 



0" 



43,85 



210,78 



55,39 



18,63 



127,15 



222,04 



68,52 



546,97 



1242,77 



162,61 



385,39 



105,38 



g9 0,90 



gio 



gll 



gl2 



213 



gl5 



gl6 



gl7 



gl8 



gl9 



g20 



0,89 



0,88 



0,87 



0,87 



gl4 0,86 



0,85 



0,84 



-0,84 



0,85 



-0,83 



0,46 



0 



0 



0 



0< 



67,80 



0* 



0 



0 



0 



0 



0 



0 



64,10 
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t(15;17) 

vs. 
remainder 


D I 

- C 


=> i 


c 
c 


— < 

■vi >< 
■ 


^ < 


i 

i 






t(8;21) 

vs. 
rpmainder 


3 < 
O < 

< 


30 I 

d j 










CO 

lO | 

oi 
i 


Table 24: 

— . — r 


inv(16) 

vs. 
r6rnairicit?i 


1.00 


0.95 


-1.52 










inv(16) 

vs. 
t(8;21) 


1.00 


0.93 






i 






t(15;17) 

vs. 
inv(16) 


1.00 


0.96 












t(15;17) 

vs. 
t(8;21) 


1.00 


0.91 










2.85 


Classes 


Accuracy 


Prediction strength 


M65066 


AL049933 


AF010310 


N90866 


M26326 


|_w . 
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< 
< 


r> 


< 
< 


» 
z> 

ri 


< 
< 

< 


■o 

rsi 


1 

< 

< 


n 

>i 
i 






















6.99 




6.56 








8.43 




7.78 




6.56 












-6.84 


3.08 
















3.08 






2.68 




N99340 


M25915 


P(g,c) AF013570 


AI207842 


X16665 


X96719 


AF013611 


W72424 
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Table 25 



A 

Gen Bank 

• I 
accession 

C 

No. 

£ 


approved 
JCUHGNC/HUG 

C 

} database 
,ymbol 


Description 


identified 
according to 
Golub et ai. 


identified 
utilizing 
multiple-tree 
classifiers 


M65066 / 


3RKAR1B c 


sAMP-dependent protein Kinase reguiaiory s»uuuiui m u^i« 


X 




AL049933 < 


< 

3NAI1 « 


guanine nucieotiae Dinaing proiein pruiom;, aif-ma iini«*i»my 
activity polypeptide 1 


X 




AF010310 


P/G6* 


proline oxidase homolog 


X 




N90866 


CDW52 


CDW52 antigen (CAM PATH- 1 antigen) 


X 




M26326 


KRT18 


keratin, type i cytoskeletal 18 


x 


X 


N99340 


DKFZP586N1922 


DKFZP586N1922 protein 


X 


X 


M25915 


CLU 


clusterin precursor 


X 




AI207842 


PTGDS 


prostaglandin-!^ d-isomerase precursor 


X 




X16665 


HOXB2 


homeobox protein hox-b2 


X 


X 


X96719 


CLECSF2 


C-type (calcium dependent, caroanyardi©-f©i*uyimiuM uuinmn 
lectin, superfamiiy member 2 (activation-induced) 


) 

f 

X 


X 


AF013611 


CTSW 


cathepsin w (lymphopain) precursor 


X 


X 


W72424 


S100A9 


calgranulin b (migration inhibitory factor-related protein 14) 


X 




AF01357O 


MYH11 


myosin heavy chain, smooth muscle isoform 


X 


X 


AF001548 


MYH11 


myosin heavy chain, smooth muscle isoform 




X 



777 



<WO 03039443A2 I _> 



WO 03/039443 



PCT/EP02/12303 



X53742 F 


BLN1 Ti 


bulin-1 




X 


U37122 A 


DD3 g 


amma adducin 




X 


J03853 /> 


\DRA2C a 


lpha-2c-1 adrenergic receptor 




X 


Y10183 / 


C 

\LCAM r 


:D166 antigen precursor (activated leukocyte-cell adhesion 
nolecule) 




X . 


AB002313 / 


=>LXNB2 I 


alexin B2 




X 


X78817 


<KRHGAP4 


-ho GTPase activating protein 4 




X 


X54486 


SERPING1 


plasma protease d inhibitor precursor 




X 


L19872 


AHR 


aryl hydrocarbon receptor 




X 


M15395 


ITGB2 


CD18, integrin beta-2 precursor 




X 


AF045229 


RGS10 


regulator of g-protetn signaling 1 0 




X 


D43638 


CBFA2T1 


MTG8 protein (ETO protein) 




X 


M25280 


SELL 


l-selectin precursor (lymph node homing receptor) 




X 


W25986 


DKFZP564K082Z 
* 


I 

hypothetical protein DKFZp564K0822 




X 


M36035 


BZRP 


peripheral-type benzodiazepine receptor 




X 


X64624 


POU4F1 


brain-specific homeobox/pou domain protein 3a 




X 


M18728 


CEACAM6 


carcinoembryonic antigen-related cell adhesion molecule 6 (nor 
specific cross reacting antigen) 


i- 


X 


M77349 


TGFBI 


transforming growth factor-beta induced protein ig-h3 precursor 




X 
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M80899 / 


<\HNAK 


neuroblast differentiation associated protein ahnak 




X 


» 

M 13560 


CD74 


CD74 antigen, (invariant polypeptide of major histocompatibility 
complex, class II antigen-associated) 




X 


X62744 


HLA-DMA 


major histocompatibility complex, class ll, DM alpha, RING6 

» 




X 


M32578 


HLA-DRB1 


HLA class II histocompatibility antigen, dr-1(dw14) beta chain 
precursor 




X 


X00457 


HLA-DPA1 


HLA class II histocompatibility antigen, dp alpha chain precursor 




X 


J00194 


HLA-DRA 


HLA class II histocompatibility antigen, dr alpha chain precursor 




X 
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Table 26: Comparisons of protein expression and mRNA abundance in 
leukemia as assessed by flow cytometry and microarray analysis 



Antigen 


Number of ! 
comparisons 


□Ath PP and MA 
no^itive 


Both FC and MA 
negative 


MA positive and 
FC negative 


FC positive and 
MA negative 


Myeloperoxidase 


OK 

zo 


25 






- 


CD13 




24 






1 


CD33 


ZD 


24 






1 


CD45 


0*1 


21 




_ 


• 


HLA-DR 


10 


10 






- 


CD135 


4 






1 


- 


CD61 






19 


_ 


- 


CD10 


A C 

15 




15 






CD235a 


14 




12 


2 


- 


NG2 


A A 

1 1 




11 


— : 


_ 


CD22 


5 




5 


: 




CD133 


4 




4 




_ 


CD79a 


2 




1 


1 


- 


CD14 


23 


c 

w 


14 


3 


_ 


CD34 


22 


17 


4 


1 


_ 


CD2 


22 


lw 


3 


3 


- 


CD7 


22 




8 




_ 


CD15 


19 


15 


1 


3 


- 


CD3 


9 


4 


4 


1 


_ 


Lactoferrin 


13 


q 


2 


3 


- 


CD116 


c 
D 




2 




_ 


CD11b 


-7 
/ 


5 


2 


: 


- 


CD19 


A O 

12 


5 


2 


5 


_ 


CD36 


22 


2 


14 


- 


6 


LrUOO 


9 


4 


1 




4 


CD4 


18 


6 


7 


2 


3 


CD56 


23 


2 


17 




4 


CD64 


23 


15 


4 


3 


1 


TdT 


20 




17 


2 


1 


Total 


450 (100%) 


230(51.1%) j 169(37.6%) 
399 (88.7%) congruent 


30 (6.7%) 
51 (11.4%) i 


21 (4.7%) 
not congruent 



Protein expression and mRNA abundance were compared in 25 patients. "Number of 
5 comparisons" indicates the number of patients analyzed for the respective antigens 
(maximum number, 25 patients) 
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Table 27 



Affymet 
rix ID 



Description_microarray 



Symbol 



Descriptio Chromosom 
n NetAffx e 



1087_a 
t 



M60459 /FEATURE= 
/DEFINITION=HUMERYTH Human 
erythropoietin receptor mRNA, complete cds 



AD000092 /FEATURE=cds#2 
/DEFINITION=CH19HHR23 Homo sapiens 
1 747_a DNA from chromosome 1 9p1 3.2 cosmids 
t R31240, R30272 and R28549 containing the 
EKLF, GCDH, CRTC, and RAD23A genes, 
genomic sequence 

AD000092 /FEATU RE=cds#6 
/DEFINITION=CH19HHR23 Homo sapiens 
1 752_a DNA from chromosome 1 9p1 3.2 cosmids 
t R31240, R30272 and R28549 containing the 
EKLF, GCDH, CRTC, and RAD23A genes, 
genomic sequence 

S82470 /FEATU RE= /DEFINITION=S82470 
BB1=malignant cell expression-enhanced 
180_at gene/tumor progression-enhanced gene 
[human, UM-UC-9 bladder carcinoma cell 
line, mRNA, 1897 nt] 



206 at 



M84424 /FEATURE=expanded_cds 
/DEFINITION=HUMCTSE09 Human 
cathepsin E (CTSE) gene, exon 9 and 



erythropoietin 
receptor 



19p13.3- 
p13.2 




leukocyte 
I receptor cluster 
j (LRC) member 
4 



19q13.4 



cathepsin E 



781 



WO 03/039443 



PCT/EP02/12303 



31381. 
at 



31682_ 
s at 



31749. 
f at 



32323. 
at 



complete cds 

Cluster Incl. AF076483:Homo sapiens 
peptidoglycan recognition protein precursor 
(PGRP) mRNA, complete cds /cds=(44,634) 
/gb=AF076483 /gi=3342532 /ug=Hs. 137583 
/len=690 

Cluster Incl. D32039:Human pgH3 mRNA for 
proteoglycan PG-M(V3), complete cds 
/cds=(1 05,2072) /gb=D32039 /gi=1008912 
/ug=Hs.234753 /len=2087 

Cluster Incl. Z98744:histone H2A 
/cds=(7,399) /gb=Z98744 /gi=3080457 
/ug=Hs. 1 31 954 /len=499 

Cluster Incl. M63582:Human 
preprothyrotropin-releasing hormone gene 
/cds=(8,736) /gb=M63582 /gi=1 90297 
/ug=Hs.1 82231 /len=1457 



peptidoglycan 
recognition 
protein 



chondral tin 
sulfate 
proteoglycan 2 
(versican) 



19q13.2- 
q13.3 



5q14.3 




H2A histone 
family, member 6p22-p21.3 

N 



33093. 
at 



33584. 
at 



34110. 
g_at 



Cluster Incl. AF077346:Homo sapiens 
interleukin-18 receptor accessory protein-like 

mRNA, complete cds /cds=(483,2282) 
/gb=AF077346 /gi=3851059 /ug=Hs.158315 
/len=2681 

Cluster Incl. U35146:Human p56 KKIAMRE 
protein kinase (KKIAMRE), complete cds 
/cds=(0,1481) /gb=U35146 /gi=1517819 
/ug=Hs.158512 /len=1482 

Cluster Incl. AF010310:Homo sapiens p53 

induced protein mRNA, partial cds 
/cds=(0,761) /gb=AF010310 /gi=2415296 




thyrotropin- 
releasing 
hormone 



3q13.3-q21 



interleukin 18 
receptor 
accessory 
protein 

cyclin- 
dependent 
kinase-tike 2 
(CDC2-re!ated 
kinase) 

proline oxidase 
homolog 



2p24.3- 
p24.1 
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/ug=Hs.21 1605 /len=888 

Cluster Incl. AL049651 .Human DNA 

sequence from clone 753D10 on 
chromosome 20 Contains genes for 
341 39_ SSTR4(somatostatin receptor 4) and 
at THBD(thrombomodulin), ESTs, STSs, GSSs 
and CpG islands /cds=(98, 1264) 
/gb=AL049651 /gi=4741619 /ug=Hs.226015 
/len=1427 



34800. 
at 



Cluster Incl. AL039458:DKFZp434N0910_s1 
Homo sapiens cDNA, 3 end 
/clone=DKFZp434N091 0 /clone_end=3 
/gb=AL039458 /gi=5408506 /ug=Hs.4193 
/len=849 



Cluster Incl. M15059:Human Fc-epsilon 
34960_ receptor (IgE receptor)' mRNA, complete cds 
g_at (H1 07 epitope) /cds=(21 3,1 1 78) /gb=M1 5059 
/gi=1 82447 /ug=Hs.1416 /len=1530 

Cluster Incl. AB009598:Homo sapiens 
35179_ mRNA for glucuronyltransferase I, complete 
at cds /cds=(29,1 036) /gb=AB009598 

/gi=3892639 /ug=Hs.26492 /len=1441 

Cluster Incl. AC00441 0:Homo sapiens 
35426_ chromosome 19, fosmid 39554 

at /cds=(0, 1 1 96) /gb=AC00441 0 /gi=2959558 
/ug=Hs. 167352 /len=1 197 

35552_ Cluster Incl. AI041 1 80:ov77e05.x1 Homo 
at sapiens cDNA, 3 end /clone=IMAGE- 
1643360 /clone_end=3 /gb=AI041180 



somatostatin 
receptor 4 



ortholog of 
mouse integral 
membrane 
glycoprotein 
LIG-1 

Fc fragment of 
IgE, low affinity 
II, receptor for 
(CD23A) 



beta- 1,3- 
glucuronyltrans 

ferase 3 
(glucuronosyltr 
ansferase I) 

hypothetical 
protein from 
EUROIMAGE 
42353 



20p11.2 



phosphate 
cytidylyltransfer 
ase 1, choline, 



19p13.3 



11q12.2 



19p13.3 



Xp22.22 
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/gi=3280374 /ug=Hs.1 32794 /len=810 

Cluster Incl. M26326:Human keratin 18 
35766_ mRNA, complete cds /cds=(51 , 1 343) 
at /gb=M26326 /gi=1 86690 /ug=Hs.651 1 4 

/len=1412 




36021 _ 
at 



Cluster Incl. AL049409:Homo sapiens 
mRNA; cDNA DKFZp586H0919 (from clone 

DKFZp586H0919) /cds=UNKNOWN 
/gb=AL049409 /gi=4500194 /ug=Hs.44865 
/len=1419 



1 





Cluster Incl. U43959:Human beta 4 adducin 
36052_ mRNA, alternatively spliced partial cds 
at /cds=(0,938)/gb=U43959/gi=1 172145 
/ ug =Hs.4852 /len=1284 

Cluster Incl. N99340:IMAGE-20074 Homo 
36095_ sapiens cDNA /clone=IMAGE-20074 
at /gb=N99340 /gi= 1270755 /ug=Hs.7357 

/len=1110 

Cluster Incl. U51333:Human hexokinase 
36372_ (HK3) mRNA, complete cds /cds=(74,2845) 
at /gb=U51 333 /gi=1 255787 /ug=Hs.1 59237 

/len=3049 



mm 



beta isoform 



keratin 18 



lymphoid 
enhancer- 
binding factor 1 



II 



ADD2 




adducin 2 
(beta) 



DKFZP586N19 
22 protein 



36464 
at 



Cluster Incl. X94323:H.sapiens mRNA for 
'- SGP28 protein /cds=(40,777) /gb=X94323 
/gi=1213612 /ug=Hs.54431 /len=2124 



Cluster Incl. AA883870:am26f01.s1 Homo 
36657_ sapiens cDNA, 3 end /clone=IMAGE- 
at 1467961 /clone_end=3 /gb=AA883870 
/gi=2993400 /ug=Hs.75615 /len=599 



§l§tf§fl§ 



hexokinase 3 
(white cell) 



12q13 



4q23-q25 



2p14-p13 



19q13.1 



5q35.2 



specific granule 
protein (28 
kDa) 



apolipoprotein 
C-U 



6p12.3 



- 

y:'.::/Z. : :'^'.:'. rv;;^-; 



19q13.2 
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Cluster Incl. Z38026:H.sapiens mRNA for 
3671 0_ FALL-39 peptide antibiotic /cds=(1 1 ,523) 
at /gb=Z38026 /gi=558378 /ug=Hs.51 1 20 

/len=615 



Cluster Incl. M25915:Human complement 
36780_ cytolysis inhibitor (CLI) mRNA, complete cds 
at /cds=(198,1544) /gb=M25915 /gi=180619 
/ug=Hs.75106 /len=1651 



Cluster Incl. D87433:Human mRNA for 
38487_ KIAA0246 gene, partial cds /cds=(0,6639) 
at /gb=D87433 /gi=1 665760 /ug=Hs.84753 

/len=6777 

Cluster Incl. AF062534:Homo sapiens 
38975_ genethonin 1 mRNA, complete cds 
at /cds=(127,1203) /gb=AF062534 /gi=3851521 
/ug=Hs. 1 09590 /len=2340 

Cluster Incl. U03057:Human actin bundling 
39070_ protein (HSN) mRNA, complete cds 
at /cds=(111,1592)/gb=U03057/gi=458027 
/ug=Hs.1 1 8400 /len=2767 

39221_ Cluster Incl. AF004231 :Homo sapiens 
at monocyte/macrophage Ig-related receptor 
MIR-1 0 (MIR cl-1 0) mRNA, complete cds 



cathelicidin 
antimicrobial 
peptide 

clusterin 
(complement 
lysis inhibitor, 
SP-40,40, 
sulfated 
glycoprotein 2, 
testosterone- 
repressed 
prostate 
message 2, 
apolipoprotein 
J) 

hypothetical 

protein 
FU 12442 



3p21.3 



8p21-p12 



genethonin 1 4q24-q25 



singed 
(Drosophila)- 
like (sea urchin 
fascin homolog 
like) 

leukocyte 
immunoglobulin 
-like receptor, 



7p22 



19q13.4 
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/cds=(208,2001) /gb=AF004231 /gi=2343110 
/ug=Hs.22405 /len=2863 



39307_ 
s at 



39775_ 
at 



40282. 
s at 



40365. 
at 



Cluster Incl. X81637:H.sapiens clathrin light 
chain b gene /cds=UNKNOWN /gb=X81637 
/gi=963046 /ug=Hs.239782 /len=5938 

Cluster Incl. X54486:Human gene for C1- 
inhibitor /cds=(60,1 562) /gb=X54486 
/gi=29534 /ug=Hs.151242 /len=1827 

Cluster Incl. M84526:Human 
adipsin/complement factor D mRNA, 
complete cds /cds=(54,740) /gb=M84526 
/gi=1 78625 /ug=Hs.1 55597 /len=1071 



Cluster Incl. M63904:Human G-alpha 16 

protein mRNA, complete cds 
/cds=(219,1343) /gb=M63904 /gi=1 82891 
Vug=Hs.73797 /len=2060 



40763. 
at 



41045. 
at 



41448 



cds /cds=(65,1237) /gb=U85707 
/gi=2058550 /ug=Hs.170177 /len=251 1 

Cluster Incl. U77643:Homo sapiens K12 
protein precursor mRNA, complete cds 
/cds=(1 1 8,864) /gb=U77643 /gi=2062390 
/ug=Hs.95655 /len=2000 



clone DJ0170O19 from 7p15-p21 



- Z 


subfamily B 
(with TM and 


. . ' • - 


TIM domains), 
member 2 


r'" 

' ' :<>'■'•</,' 't's/f'i 


clathrin, light 
polypeptide 






;/" : - , // > < y V 


complement 


SERPfNGi 

, t > ; 


component 1 1 
inhibitor 








D component 
nf comDlement 




(adipsin) 




guanine 




nucleotide 


* , - 

;--&NA15 


binding protein 




(G protein), 




alpha 15(Gq 
class) 


; pti 

MEIS1 


: 
- 

| Meisl (mouse) 




\ homolog 


^SlECTMl 


1 

I secreted and 
i transmembrane 

i 1 

:•• 






Z .. HOXA1 0 


Ihomeo box A10 

1 




i 



4q21-qter 



I1q12-q13.1 



19p13.3 



19p13.3 



2p14-p13 



17q25 



7p15-p14 
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at /cds=(0, 1 247) /gb=AC004080 /gi=28221 64 
/ug=Hs. 1 1 0637 /len=1 248 



41815. 
at 



Cluster Incl. AL080133:Homo sapiens 
mRNA; cDNA DKFZp434G173 (from clone 

DKFZp434G173) /cds=(1 22,3400) 
/gb=AL080133 /gi=5262573 /ug=Hs.57749 
/len=4307 




synaptic nuclei 
expressed 1 4q23.2 
gene 2 
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Table 28a 



classes BM-t(8;21) BM - BM - BM - BM-AML 

t(15;17) inv(16) t(11q23)/ML 

L 



accuracy 


1.00 


1.00 


100 


1.00 


t 

1.00 


prediction 
strenght 


0.88 


0.91 


0.99 


0.89 


0.89 


Symbol 


P(g.c) 

values 


P(g,c) 

values 


P(g,c) 

values 


P(g.c) 

values 


P(g,c) 

values 


EPOR 




7.36 


7.36 




2.74 


unknown cDNA* 


■ 


-6.46 








unknown cDNA* 




-2.76 




















LENG4* - 

■ - 

• 




-2.92 








CTSE 




3.35 




2.51 


2.39 


PGLYRP 




7.39 








CSPG2 




7.39 








H2AFN -\i 








-5.02 




TRH 


-2.12 
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IL*8RARr «fe 










4.79 




CDKL2 

> * - 

ill 










-3.12 








PSG6* : {/[\ 






-2.92 








> 

SSTR4 

- - . - ' 

'xf' ••-•:?!"' \* : "•• :"•> ?. : 


■ 






16.61 








Illta 


IIS 


WW-Z\ 

hi 










3.06 


- * 

it 




4.72 








B3GAT3 


'Ws^s% 






-2.04 








LOC56928* 








-8.39 


-1.70 


PGYT1B 




3.56 










KRT1£ 

. 






- 


-3.02 








LEF1 


<0SS^S : 

lllli 


V. 


111! 








3.26 


3.26 


ADD2 


V -V: - 




5.63 








DKFZP58 


6N192 






-8.84 

» 






HK3 

£ ~: - 


we ■ 




6.55 








SGP28* . 

; 


life 
' t 


t; 


i 








3.04 


:: «s;s:':::sj. v"> . 

APOC 

- 


11 

ill 


.. ^ •: - , 


1 


-5.33 




-4.16 




CAMF 

'^^'^ 


>/; 

Mi 


;• >• 

-W - : . 




1 


6.40 










y . ,t 


1 


-2.71 
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FLJ12442" 

" 


— r 


-2.16 








GENX-3414* 








1.97 




SNL 




-3.64 








LILRB2 




9.75 






• 


CLTB 

■ 








-4.38 




SERPING1 




-2.48 








DF 




-2.95 






-1.73 


GNA15 




-2.69 








MEIS1 








-4.22 




SECTIV11 




7.07 








HOXA10 


\ ■ 






-3.18 




SYNE-2* 








3.36 
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Table 29 



Analysis of the listed 14 leukemia subgroups and normal 
bone marrow (BM) according to the method as described by 
Golub et al. in pairwise comparison 



N 

ALLt(4;11) 9 

ALLt(8;14) 4 

ALL B not Ph 9 

ALL Ph 15 

T-ALL 9 

AML+8 10 

AML complex 36 

AML normal 62 

AMLt(8;21) 13 

AMLt(15;17) 20 

AML inv(16) 12 

AML MLL 15 

CLL 32 

CML 14 

normal BM 9 
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ALL t(4;11) vs. all other 


samples: 9 / 260 






accuracy 


1 






confidencG 


0.983255511396901 




decision limit gene symbol 


gene 


signal-to-noise 


P 


215925_s_at 


2. 1521*111 9031 41 3 


0 


NRM 


225592 at 


2.10479676639873 


0 


205821_at 


2.06781828794101 


0 


D12S2489E 


209168_at 


2.03682741085015 


0 


GPM6B 


225563_at 


2.0365845909197 


0 




209170_s_at 


1.99341681464758 


0 


GPM6B 


219033_at 


1.98127277039877 


0* 


3296.75 FLJ21308 


227407_at 


1 .980664540031 1 


0* 


2368.95 


226496 at 


1.94883167321783 


0 




219463_at 


1.92861464656998 


0 


C20orf103 


203796 s at 


1.90484126349741 


0 


BCL7A 


210934_at 


1.78273564893858 


0 


BLK 


221969_at 


1.72749112194125 


0 


PAX5 


239393_at 


1.72516948033426 


0 




238750_at 


1.71981673787555 


0 




23921 4_at 


1.67607810398359 


0* 


645.75 


218469_at 


1 .65073049955565 


0 


CKTSF1B1 


226244_jat 


1 .64755028976206 


0 




218384_at 


1.64357646019602 


0 


CRHSP-24 


20981 5_at 


1 .64023870547228 


0 


PTCH 



794 

BNSDOCID: <WO 03039443A2_I_> 



WO 03/039443 
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t(4;11)vs. t(15;17) 

accuracy 

confidence 

gene 

221969_at 
207697_x_at 
203948_s_at 
22491 8_x_at 
203949_at 
231736_x_at 
235101_at 
I 202481 _at 
219463_at 
201540_at 
238583_at 
205382_s_at 
226878_at 
210934_at 
226545_at 
38487_at 
220798_x_at 
232201_at 
204069_at 
244261 at 



samples: 9 / 20 

1 

1 

signal-to-noise p 

5.83664657385464 0* 

5.35854600658567 0 

-5.11890143327711 0 

-4.9355306862525 0 

-4.8364973202297 0 

-4.46563746824662 0 

4.28876186377802 0 

4.05195275593644 0 

3.9253737114322 0 

3.81769665767171 0 

-3.81099135622948 0 

-3.80101585016411 0 

3.74369099536436 0 

3.72127171897839 0 

3.69021106148297 0 

-3.54247992575908 0 

-3.50808230625528 0 

3.50091671488931 0 

3.4252967216987 0 

3.40910490910101 0 



decision limit 
1301 



gene symbol 

PAX5 

LILRB2 

MPO 

MGST1 

MPO 

MGST1 

KIAA1014 

SDR1. 

C20orf103 

FHL1 

DF 

BLK 

FLJ 12442 
FLJ11535 
NKD2 
MEIS1 
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t(4;11)vs. inv(16) 

accuracy 

confidence 

gene 

225653_at 

221969_at 

231259_s_at 

203949_at 

200951 _s_at 

219463_at 

203948_s_at 

200953_s_at 

217979_at 

210934_at 

20421 4_s_at 

203973_s_at 

232201_at 

216860_s_at 

244261_at - 

201360_at 

226496_at 

238824_at 

217966_s_at 

38340 at 



samples: 9/12 

1 

1 

signal-to-noise p 

-6.57614058007186 0* 

5.83664657385464 0 

-5.6800993536981 9 0 

-4.47845706407372 0 

-4.24994685464806 0 

3.9253737114322 0 

-3.82832616186979 0 

-3.81507848947069 0 

3.7506313191621 0 

3.72127171897839 0 

-3.64249678228396 0 

-3.54203246324105 0 

3.50091671488931 0 

3.46199526723217 0 

3.40910490910101 0 

-3.40410310063102 0 

3.36372190983709 0 

3.31690199423555 0 

-3.29280071064851 0 
2.62428006100593 0 



decision limit 

129.2 



gene symbol 

PAX5 

CCND2 

MPO 

CCND2 

C20orf103 

MPO 

CCND2 

NET-6 

BLK 

RAB32 

CEBPD 

NKD2 

GDF11 

CST3 



C1orf24 
HIP12 
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ALL t(4;H) vs. AML MLL 

accuracy 

confidence 

gene 

221969_at 

226795_at 

205821_at 

201360_at 

244261_at 

217979_at 

20421 5_at 

211404_s_at 

217223_s_at 

206255_at 

214875_x_at 

201828_x_at 

208702_x_at 

208456_s_at 

219229_at 

210487_at 

210192_at 

226496_at 

203796_s_at 

212207 at 



samples: 9/15 

1 

1 

signal-to-noise p 

4.924403129677 0* 

4.12248256776444 0 

4.0545821 7339808 0 

-3.44934876199386 0 

3.40910490910101 0 

3.33945130505483 0 

3.27742574573237 0 

-3.1044343641572 0 

3.09813118817326 0 

3.06828641368978 0 

-2.95163557135368 0 

-2.94392957130376 0 

-2.9259192302265 0 

2.83112443648876 0 

-2.781 06365052986 0 

2.65572482698711 0 

2.63764031456982 0 

2.63678458715383 0 

2.63294411410401 0 

2.62364024475935 0 



decision limit 

1534.5 



gene symbol 
PAX5 

D12S2489E 
CST3 

NET-6 

MGC4175 

APLP2 

BLK 

APLP2 

CXX1 

APLP2 

RRAS2 

SLC21A11 

DNTT 

ATP8A1. 

BCL7A 
KIAA1025 



797 
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ALL t(4;11)vs. CLL 

accuracy 

confidence 

gene 

210045_at 

204798_at 

225592_at 

202503_s_at 

219463_at 

201540_at 

226545_at 

223276_at 

209267_s_at 

201416_at 

22471 0_at 

218384_at 

209365_s_at 

219869_s_at 

218942_at 

209825_s_at 

201417_at 

238022_at 

212281_s_at 

204069_at 



samples: 9 / 32 

1 

1 

signal-to-noise p 

6.767348531 84964 0* 

4.99101493214162 0 

4. 1 768659491 6951 0 

4.02396247600866 0 

3.9253737114322 0 

3.7195174107357 0 

3.4985271993093 0 

3.36058308258119 0 

3.33868760455037 0 

3.27673886279907 0 

3.27404618682988 0 

3.26299093107225 0 

3.22987740485106 0 

3.18151227716348 0 

-3.1335063798458 0 

3.12344649016037 0 

3.10175892236352 0 

3.08645149251292 0 

3.07915047956129 0 

3.04751039522855 0 



decision limit 

142.15 



gene symbol 

IDH2 

MYB 

NRM 

KIAA0101 

C20orf103 

FHL1 

NID67 

LOC64116 

SOX4 

RAB34 

CRHSP-24 

ECM1 

LOC64116 

FLJ22055 

UMPK 



MAC30 
MEIS1 
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ALLt(4;11) vs. CML 

accuracy 

confidence 

gene 

221969_at 

207000_s_at 

212484_at 

201485_s_at 

20421 4_s_at 

203796_s_at 

218223_s_at 

226795_at 

205821_at 

221755_at 

219463_at 

202332_at 

210254_at 

20961 9_at 

226878_at 

205557_at 

225713_at 

206440_at 

232201 _at 

204215 at 



samples: 9/14 

1 

1 

signal-to-noise p 

5.83664657385464 0* 

4.65703873448273 0 

4.34689222808594 0 

4.25216281704109 0 

-4.24363288728321 0 

4.18407933075926 0 

4.16804739438629 0 

4.12248256776444 0 

4.05458217339808 0 

3.99463418568195 0 

3.9253737114322 0 

3.87997129184981 0 

-3.855?4735854191 0 

3.77477243739775 0 

3.74369099536436 0 

-3.71054014877636 0 

3.59398617666668 0 

-3.54851691296324 0 

3.50091671488931 0 

3.43706276144109 0 



decision limit 
1301 



gene symbol 

PAX5 

PPP3CC 

MTVR 

RCN2 

RAB32 

BCL7A 

LOC51177 

D12S2489E 

C20orf103 
CSNK1E 

CD74 

BPi 

KIAA1898 
VELI1 
NKD2 
MGC4175 



799 
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ALL 1(4*11) vs. normal BM 


samples: 9/9 










1 








r^nnf i ri p n CP. 


1 










signal-to-noise 


p 


decision limit 


gene symbol 


901898 y at 


-6.82241973614495 


0* 


285.3 


CXX1 


910045 at 


6.76734853184964 


0 




1DH2 


914950 at 


6.24435639089538 


0 






9^74^1 at 


5.31661820706767 


0 






995799 at 


-5.04037264761662 


0 






905624 at 


-4.8793192574965 


0 




CP A3 


901 540 at 


4.79436840204501 


0 




FHL1 


904914 s at 


-4.53807365704691 


0 




RAB32 


oyTo__ai 


4 52409549429394 


0 




FLJ23436 




-4 29237917192722 


0 




SCD 


9H^7QR <i at 


4. 1 8407933075926 


0 




BCL7A 


oneiric*! 0 


-4 13944949609416 


0 




KIT 


202332_at 


4.13816009401715 


0 




CSNK1E 


228176_at 


-4.13090953290361 


0 






226795_at 


4.12248256776444 


0 






208754_s_at 


4.00220842620578 


0 




NAP1L1 


221755_jat 


3.99463418568195 


0 






228424_at 


-3.92813180248343 


0 




NAALADASEL 


219463_at 


3.9253737114322 


0 




C20orf103 


212967_x at 


3.92366867855542 


0 




NAP1L1 



1 
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ALL t(4;11)vs. ALL t(8;14) 


samples: 9 / 4 




accuracy 


1 




confidence 


1 




gene 


signal-to-noise 


P 


210045_at 


6.767348531 84964 


0* 


240106_at 


6.10571301118426 


0 


202853_s_at 


-5.88804457870992 


0 


242434_at 


-5.60754470569171 


0 


237431_at 


5.31661820706767 


0 


201540_at 


4.94655082712075 


0 


215855_s_at 


4.71299810202736 


0 


212357_at 


4.18767818184794 


0 


204798_at 


4.12313508850913 


0 


226795_at 


4.12248256776444 


0 


77508_r_at 


4.0317985345148 


0 


46142_at 


3.95748459279267 


0 


225277_at 


3.7552029934786 


0 


210934_at 


3.72127171897839 


0 


215537_x_at 


3.53324247477066 


0 


232201 _at 


3.50091671488931 


0 


214505_s_at 


3.42260996379197 


0 


244261_at 


3.40910490910101 


0 


20861 4_s_at 


3.34565043119022 


0 


210298_x_at 


3.30171221592859 


0.01 



decision limit 

142.15 



gene symbol 
IDH2 

RYK 



FHL1 

KIAA0280 
MYB 

FLJ23282 
FLJ12681 

BLK 

NKD2 
FHL1 

FLNB 
FHL1 



801 
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ALL t(4;11) vs. ALL B not Ph 

accuracy 
confidence . 
gene 

237431_at 

219033_at 

219463_at 

204069_jat 

201105_at 

200907_s_at 

24241 4_at 

222492_at 

230441_at 

235291_s_at 

225592_at 

200906_s_at 

201153_s_at 

201152_s_at 

241985_at 

213894_at 

243756_at 

225563_at 

232231 _at 

240581 at 



samples: 9 / 9 

1 

1 

signal-to-noise p 

5.31661820706767 0* 

3.19032095561144 0 

2.69567768562793 0 

2.54127866831197 0 

2.30596776500018 0 

2.19034049161844 0 

2.08870062415486 0 

-2.04226084466602 0 

-2.03717805375485 0 

2.0210425168076 0 

2.01329642963674 0 

1.9632986862999 0 

1.93254941630797 0 

1.93227192981893 0 

-1.91875000661653 0 

1.89533012552966 0 

1.82938790694615 0 

1.82046495626766 0 

1.8138577510169 0 

1.80942575017411 0 



decision limit 

49 



gene symbol 

FLJ21308 

C20orf103 

MEIS1 

LGALS1 

KIAA0992 

FLJ21324 



NRM 

MBNL 
MBNL 

KIAA0960 
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ALL t(4;11)vs. ALL Ph 

accuracy 

confidence 

gene 

204069_at 

219033_at 

219463_at 

221969_at 

201874__at 

209170_s_at 

233500_x_at 

205899_at 

24241 4_at 

205821 _at 

205055_at 

209168_jat 

226939_at 

209354_at 

200906_s_at 

225563_at 

237431 _at 

3421 0_at 

202853_s_at 

209167 at 



samples: 9/15 

1 

1 

signal-to-noise p 

3.31118886883646 0* 

3.24786485857293 0 

2.78353259146178 0 

2.4416835946504 0 

2.38057857279198 0 
2.31526351178702 . 0 

2.23629715560156 0 

2.23375311954146 0 

2.21567202901383 0 

2.10144186601662 0 

2.04951650013049 0 

2.02899949049115 0 

1.99568402107224 0 

-1.9938191217443 0 

1.9632986862999 0 

1.88712595721941 0 

1.87880985981148 0 

-1.86187594200254 0 

-1.84984316383277 0 

1.84829764568262 0 



decision limit 

482.55 



gene symbol 

MEIS1 

FLJ21308 

C20orf103 

PAX5 

FLJ21047 

GPM6B 

LLT1 

CCNA1 

D12S2489E 

ITGAE 

GPM6B 

TNFRSF14 



CDW52 

RYK 

GPM6B 
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ALLt(4;11) vs. T-ALL 

accuracy 

confidence 

gene 

221969_at 

237431_at 

213772_s_at 

219463_at 

210934_at 

242292_at 

226496_at 

244261 _at 

205640_at 

219033_at 

209168_at 

205821 _at 

235706_at 

225592_at 

22531 4_at 

209170_s_at 

215925_s_at 

204069_at 

210192_at 

217080_s_at 



samples: 9 / 9 

1 

1 

signal-to-noise p 

5.83664657385464 0* 

5.31661820706767 0 

4.80540935532116 0 

3.9253737114322 0 

3.72127171897839 0 

-3.53231266693437 0 

3.49024496949904 0 

3.40910490910101 0 

3.04306419050975 0 

3.03434195473282 0 

3.02932118514235 0 

2.9956567374574 0 

2.99518505957364 0 

2.98114633774562 0 
-2.87525509857994 ■ 0 

2.832580891342 0 

2.79328349720245 0 

2.67025815490556 0 

2.63764031456982 0 

2.60699026264913 0 



decision limit 

1301 



gene symbol 
PAX5 

GGA2 

C20orf103 

BLK 



ALDH3B1 

FLJ21308 

GPM6B 

D12S2489E 

CPM 

NRM 

GPM6B 

MEIS1 

ATP8A1 

HOMER-2B 



804 
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ALL t(4;11) vs. AML +8 

accuracy 

confidence 

gene 

201828_x_at 

221969jat 

244261 _at 

210024_s_at 

219013_at 

226496_at 

205821_at 

219229_at 

222422_s_at 

233138_at 

206255_at 

217979_at 

203796_s_at 

214761_at 

227407_at 

209365_s_at 

228379_at 

20421 4_s_at 

202626_s_at 

244876 at 



samples: 9/10 

1 

1 

signal-to-noise p 

-4.66708459109445 0* 

3.82090922998168 0 

3.40910490910101 0 

-3.20983153576119 0 

-3.19035252569463 0 

3.13934828480165 0 

3.04595286153542 0 

-3.04273361188054 0 

-3.02123752629534 0 

2.95580347805273 0 

2.76163042569406 0 

2.7584725396168 0 

2.72124988867035 0 

2.62282117562643 0 

2.55763093921665 0 

2.53044674597142 0 

2.52915026244479 0 

-2.51760565011465 0 
-2.5141028457265 0 
2.46777854429452 0 



decision limit 

226.35 



gene symbol 

CXX1 
PAX5 

UBE2E3 
FLJ21634 

D12S2489E 

SLC21A11 

MGC10924 

BLK 
NET-6 
BCL7A 
OA2 

ECM1 

RAB32 
LYN 



BNSDOCID: <WO O3039443A2J_> 



805 



WO 03/039443 



PCT/EP02/12303 



ALL t(4;1 1 ) vs. AML complex samples: 9 / 36 



accuracy 


1 




confidence 


1 




gene 


signal-to-noise 


P 


221969_at 


4.27209531 988207 


n* 
u 


210934_at 


3.72 1 27 1 7 1 897839 


n 
u 


244261 _at 


3.4091 049091 01 01 


n 
u 


219463_at 


3.31 691 046444O78 


n 
u 


206255_at 


2.78386884086493 


r\ 
u 


226496_at 


2.68550006340332 


U 


209170_s_at 


2.55250295984436 


U 


209168_at 


2.4431 1832949527 


U 


215925_s_at 


j fust's C 4 

2.3466398637351 3 


U 


226244_at 


2.33002353009868 


U 


219033_at 


2.29968992 /yoo^l 




205821_at 


2.2854186259635 


0 


232201_at 


2.05970386931189 


0 


209167_at 


2.05931372040727 


0 


244876_at 


2.04904743648645 


0 


212658_at 


-2.00156707557967 


0 


227407_at 


1.9737052609863 


0 


202853_s_at 


-1 .93859575890205 


0 


203796_s_at 


1.92961910857241 


0 


214761_at 


1.92608972571974 


0 



decision limit gene symbol 

1815.1 PAX5 
BLK 

C20orf103 
BLK 

GPM6B 
GPM6B 



FLJ21308 
D12S2489E 
NKD2 
GPM6B 

LHFPL2 

RYK 

BCL7A 

OAZ 



BNSDOCID: <WO 03039443A2_I_> 



806 



WO 03/039443 



PCT/EP02/12303 



i 



ALL t(4;11) vs. AWIL normal 


samples, y / oz 








accuracy 


1 








confidence 


1 








gene 


signal-to-noise 


P 


He»eision limit 

UCvlwlvll in i ill. 


gene symbol 


221969_at 


4.5288045o39boz1 


u 


1791.55 


PAX5 


210934jat 


3.721 271 71 897839 


0 




BLK 


205821 _at 


2.953445491 yoyoo 


u 




D12S2489E 


206255_at 


2.91 97631 21 33621 


u 




BLK 


226496_at 


2. 83306 1 8848202b 


u 






215925_s_at 


2.5921 353451 b2o 


u 






244261_at 


2.54559007441 965 


U 






244876_at 


2.46777854429452 


U 






203796_s_at 


2.381 756361 57975 


U 




BCL7A 


20981 5_at 


2.31 390268701643 


U 




PTCH 


20421 5_at 


2.24841 20041 7248 


0 






227407_at 


2.2323195471751 


0 






214761_at 


2.22867492056937 


0 




OAZ 


226244_at 


2.1571079710692 


0 






225563_at 


2.13989749649066 


0 






225592_at 


2.10215542418477 


0 




NRM 


35974_at 


2.05472986731736 


0 




LRMP 


217979_at 


2.03188116697557 


0 




NET-6 


230292_at 


2.02307533754428 


0 






209168_at 


2.01872981119451 


0 




GPM6B 



BNSDOCID: <WO 03039443A2_I_> 



807 



WO 03/039443 
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ALL t(4;11) _vs..AML t(8;21) 


sampies. s?/ io 








accuracy 


i 








confidence 


1 

i 






gene symbol 


gene 


signai-io-noi5>c 


n 


decision limit 


212484_at 


a '3v4fiP.Q999 808594 


0* 


121.4 


MTVR 


221969_at 


4 9740^05^785902 


o 




PAX5 


203796_s_at 




o 




BCL7A 


219463_at 


^ 095^7^71 14322 


0 




C20orf103 


210934_at 


^ 79197171897839 


0 




BLK 


232201 _at 


^ 50091671488931 


0 




NKD2 


244261_at 


3.4091 04yuyn ui u i 


n 






204069_at 


n O/i CO /1 700QC01 78 


o 




MEIS1 


204214_s_at 


o oni^Q'il 910851 8fi 


o 




RAB32 


225592_at 


3.20ooooUDZU/ uoo 


o 




NRM 


206255_at 


3.141 40o^ 1 o /Uo / o 


n 




BLK 


212658_at 


-3.07377549400227 


0 




LHFPL2 


226496_at 


3.07152636043149 


0 






233138_at 


2.95580347805273 


0 






227041 _at 


2.93637529041438 


0 






212480_at 


2.90213688157125 


0 




KIAA0376 


203795_s_at 


2.86560617331268 


0 




BCL7A 


202853_s_at 


-2.76308096012904 


0 




RYK 


203949__at 


-2.76134156327152 


0 




MP0 


219033_at 


2.72927502272681 


0 




FLJ21308 



808 

BNSDOCID: <WO 03039443A2_I_> 



WO 03/039443 



PCT/EP02/12303 



AWIL t(15;17) vs. all other 


samples: 20 / 24y 






accuracy 


1 






confidence 


0. 98409529 1 fZ(4f o 






gene 


signal-to-noise 


P 


decision limit gene symbol 


214450_at 


2. 55525 / /4o0ooo 


n 
w 


CTSW 


38487_at 


2.48122703820417 


0 


^07^ ^ FLJ 12442 


212953_x_at 


2 . 3839 0408532 1 


u 


CALR 


224794_s_at 


2.0409471 0734487 


U 


LOC51148 


221004_s_at 


2.02822882303obz 


u 


ITM3 


204150_at 


1.96620111034732 


0 


STAB1 


203948_s_at 


1 .77090499793556 


U 


15 MPO 


219837_s_at 


1 .67874985207037 


U 


C17 


205382_s_at 


1 .6344414400361 2 


r\ 
U 


DF 


241383_at 


1 .6305764545467 


U 




216032_s_at 


1 . ol 4o4 / OU^O 1 ODO 


o 


SDBCAG84 


200654_at 




o 


P4HB 


220798_x_at 


1 .58679638297009 


n 


FLJ11535 


208852_s_at 


1 .58602299942644 


0 


CANX 


203074_at 


1.57040201174593 


0 


ANXA8 


200656_s_at 


1.55688359776913 


0 


P4HB 


209344_at 


1.55490125921714 


0 


TPM4 


205624_at 


1.52362814764187 


0 


CPA3 


64942_at 


1.52358978934783 


0 




AFFX- 








HUMGAPDH/M331 97_5_at 






GAPD 


HG-U133A 


1.05449125226594 


0 



BNSDOCID: <WO 03039443A2_I_> 



809 



f 

if 



WO 03/039443 
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AML t(15;17) vs. AML inv(16) 

accuracy 

confidence 

gene 

204661 _at 

209732_at 

241742_at 

38487_at 

238022_at 

204563_at 

3421 0_at 

203535_at 

217478_s_at 

214450_at 

211991_s_at 

208306_x_at 

213779_at 

211990_at 

221004_s_at 

20931 2_x_at 

219789_at 

204425_at 

205076_j5_at 

64942_jat 



samples: 20/12 

1 

1 

signal-to-noise p 

-3.70847055085953 0* 

-3.23538966029247 0 

-3.11768531834572 0 

3.08138549900179 0 

3.0278549438122 0 

-2.99471501611954 0 

-2.99398735377828 0 

-2.97123029136408 0 

-2.93655072055469 0 

2.92945546081029 0 

-2.91096104465505 0 

-2.87060964824031 0 

2.84856846381654 0 

-2.76844422327205 0 

2.72545702224706 0 

-2.66880572066538 0 

-2.64334906817191 0 

-2.62831954360607 0 

-2.59502309617401 0 

2.28304127550384 0 



decision limit 

1899.15 



gene symbol 

CDW52 

CLECSF2 

PRAM-1 

FLJ12442 

SELL 

CDW52 

S100A9 

CTSW 

HLA-DPA1 

HLA-DRB4 

HLA-DPA1 
ITM3 

HLA-DRB1 
NPR3 
ARHGAP4 
CRA 



810 



BNSDOCID: <WO_ 



_03039443A2_I_> 



WO 03/039443 



PCT/EP02/12303 



AMI t/<1C<'17) wc AMI Ml 1 
A ML 1(10,1 / J VS. MML. IVIL.L. 


cjamnlp^* 90 / 15 








accuracy 


I 








confidence 

» 


i 
i 








gene 


cLinnfll-to-noise 

9IUI |QI IW 1 Iwlww 


p 


decision limit 


gene symbol 


ZUofoz:4_at 


3 0741697102978 


0* 


2052.6 


CP A3 


3848 / _ at 


9 781 1 1206643545 


0 




FLJ12442 


20394o_s__at 


9 68497999695567 


0 




MPO 


221 UU4_S_at 


9 84^96084301972 


0 




ITM3 


200951 „s_at 


O ^9094^8981 9Q1 1 


0 




CCND2 


206761 _at 


9 ^9 C »88Q4QQ^Q666 


0 




TACTILE 


203949_ at 


O 44878880*^9808 


0 




MPO 


214651_s_at 


JO 491 Q901 ^365627 


0 




HOXA9 


200952_s__at 


9 ^Q8Q8970141848 


0 




CCND2 


64942_at 


9 98^04197^50384 


0 






200953_s_at 


^.^4o40DOZ^l 0 I UO 


n 




CCND2 


204150_at 


0 o/il^anOiRC\lQ7 

^.z4ooyuz 1 001 »/ 


0 




STAB1 


212953_x_at 


2.22414089725316 


0 




CALR 


233072_at 


2.22380780245302 


0 




KIAA1857 


2l4450_at 


2.16984309325722 


-0 




CTSW 


224794_s_at 


2.14479331910386 


0 




LOC51148 


205349_at 


2.12589700684588 


0 




GNA15 


212509_s_at 


2.11142192746438 


0 






224839_s_at 


2.07810412712239 


0 




GPT2 


210788 s_at 


2.07490438621852 


0 




LOC51635 



BNSDOCID: <WO 03039443A2_I_> 



811 



WO 03/039443 
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AMLt(15;17) vs. CLL 

accuracy 

confidence 

gene 

203949_at 

203948_s_at 

206871 _at 

226043_at 

213854_at 

22491 8_x_at 

206111_at 

200654_at 

231736_x_at 

224838_at 

211990_at 

238583_at 

221004_s_at 

212400_at 

211709_s_at 

214575_s_at 

205382_s_at 

238022_at 

212953_x_at 

38487 at 



samples: 20 / 32 

1 

1 

signal-to-noise p 

6.8835946674069 0* 

5.89121269767824 0 

4.93321162912793 0 

4.54092115946342 0 

4.34395164475566 0 

4.34201320024004 0 

4.19970662571818 0 

4.03204616783544 0 

4.00585801175558 0 

-3.92476184954236 0 

-3.92005265639026 0 

3.81099135622948 0 

3.79791980511322 0 

-3.75779674692328 0 

3.73806507086654 0 

3.66149641725929 0 

3.59180789007477 0 

3.5791572544895 0 

3.54883221957949 0 

3.51682772737691 0 



decision limit 

5834.85 



gene symbol 

MPO 

MPO 

ELA2 

AGS3 

SYNGR1 

MGST1 

RNASE2 

P4HB 

MGST1 

HLA-DPA1 

ITM3 

SCGF 
AZU1 
DF 

CALR 
FLJ 12442 



812 



BMSDOCID: <WO__ 



_03039443A2J_> 



WO 03/039443 PCT/EP02/1 2303 



AMLt(15;17) vs. CML 

accuracy 

confidence 

gene 

201029_s_at 

212531_at 

206676_at 

216379_x_at 

209771_x_at 

207269_jat 

205557_at 

20201 8_s_at 

38487_at 

211657_at 

205382_s_at 

220798_x_at 

209772_s_at 

221004_s_at 

203535_at 

205863_at 

204174_at 

203757_s_at 

225386_s_at 

64942 at 



samples: 20/14 

1 

1 

signal-to-noise p 

4.80190269912041 0* 

-4.73996757450677 0 

-4.32531789709243 0 

-4.27706524116979 0 

-4.21950511119586 0 

-3.91601606801308 0 

-3.83288724786037 0 

-3.78757135021052 0 

3.67170603901023 0 

-3.58933972989264 0 

3.52725281328905 0 

3.50808230625528 0 

-3.45628903280328 0 

3.42707462770742 0 

-3.38406746753723 0 

-3.37594312179774 0 

-3.3757799683273 0 

-3.33946937382462 0 

-3.31814635286046 0 

2.28304127550384 0 



decision limit 

4487.95 



gene symbol 
MIC2 
LCN2 
CEACAM8 

CD24 
DEFA4 
BPI 
LTF , 
FLJ12442 

DF 

FU11535 

CD24 

ITM3 

S100A9 

S100A12 

ALOX5AP 

CEACAM6 

LOC92906 



BNSDOCID: <WO 03039443A2_I_> 



813 



WO 03/039443 
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1 



ANIL t(15;17) vs. normal BWI 

accuracy 

confidence 

gene 

225792_at 

20201 8_s_at 

223280_x_at 

212531_at 

203535_at 

205382_s_at 

224356_x_at 

201029_s_at 

219471_at 

223391_at 

225897_at 

209831_x_jat 

230526_at 

221004_s_at 

38487_at 

207697_x_at 

203645_s_at 

201506_at 

239278_at 

201640_x_at 



samples: 20 / 9 

1 

1 

signal-to-noise P 

-5.04037264761662 0* 

-5.02314565371121 0 

-4.43579892206636 0 

-4. 09661 809496392 0 

-3.974353553061 96 0 

3.87917813251128 0 

-3.8624083855044 0 

3.64114093559717 0 

-3.59149059254273 0 

-3.55822107244772 0 

-3.53074009879907 0 

3.47754001947329 0 

-3.32964763857134 0 

3.30073847019088 0 

3.26067308155523 0 
-3.24955398927113 0 
-3.22408073656527 0 
-3.1496662320639 0 
-3. 1 4559842984954 0 
3.1436907617525 0 



decision limit 

76.45 



gene symbol 
LTF 

MS4A6A 
LCN2 
S100A9 
DF 

MS4A6A 
MIC2 
FLJ21562 
LOC81537 

DNASE2 

FLJ20015 

ITM3 

FLJ12442 

LILRB2 

CD163 

TGFBI 

CLPTM1 



BNSDOCID: <WO 03039443A2J_> 



814 
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ANIL t(15;17) vs. ALL t(8;14) 

accuracy 

confidence 

gene 

212400_at 

239835_at 

213854_at 

209765_at 

212953_x_at 

220798_x_at 

202723_s_at 

205382_s_at 

205059_s_at 

38487_at 

235823_at 

214450_at 

201640_x_at 

242774_at 

205624_at 

221004_s_at 

202262_x_at 
. 227525_at 

227173_s_at 

46142 at 



samples: 20 / 4 

1 

1 

signal-to-noise p 

-6.34509116618667 0* 

-4.64446812677972 0 

4.34395164475566 0 

-3.90392395971701 0 

3.6338360358333 0 

3.50808230625528 0 

-3.43565959824609 0 

3.38600016039015 0 

3.37067676532123 0 

3.30566285802216 0 

-3.22388670008998 0 

3.19435746487831 0 

3.1436,907617525 0 
-3.10884740483827 0 
3.08135190030361 0 
3.04980072799641 0 
3.00631284337625 0 
-2.99175282480629 0 
-2.97620214103741 0 
2.45069282884032 0 



decision limit 

124.35 



gene symbol 

KIAA1842 

SYNGR1 

AD AM 19 

CALR 

FLJ11535 

FOX01A 

DF . 

IDUA 

FLJ12442 

CTSW 

CLPTM1 

SYNE-2 

CP A3 

ITM3 

DDAH2 

LOC1 13263 

BACH2 

FLJ12681 



BNSDOCID: <WO 03039443A2J_> 



815 
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•4 



ANIL t(15;17) vs. ALL B not Ph 

accuracy 

confidence 

gene 

205382_s_at 

2t4450_at 

220798_x_at 

209831 _x_at 

212953_x_at 

231736_x_at 

211990_at 

241383_at 

22491 8_x_at 

205624_at 

38487_at 

200654_at 

221004_s_at 

208689_s_at 

221739_at 

203679_at 

217716_s_at 

208852_s_at 

220744_s_at 

55093 at 



samples: 20 / 9 

1 

1 

signal-to-noise p 

3.94103663553541 0* 

3.7272017152315 0 

3.50808230625528 0 

3.47754001947329 0 

3.01141919123014 0 

2.98271279100376 0 

-2.92818453065756 0 

2.90001109867359 0 

2.89999004151056 0 

2.84403833450845 0 

2.72794873611638 0 

2.71316195189784 0 

2.68426024071363 0 

2.62363368134519 0 
2.61870113363546 - 0 

2.54794444503946 0 

2.27011720191064 0 

2.24073987327423 0 

2.21509524546031 0 

1.63667373135171 0 



decision limit 

3034.4 



gene symbol 
DF 

CTSW 
FLJ 11535 
DNASE2 
CALR 
MGST1 
H LA-DP A1 

MGST1 

CP A3 

FLJ12442 

P4HB 

ITM3 

RPN2 

IL27 

IL1RL1LG 

SEC61A1 

CANX 

WDR10 

KIAA1402 



BNSDOCtD: <WO 03039443A2_I_> 



816 
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AML t(15;17) vs. ALL Ph 

accuracy 

confidence 

gene 

211990_at 

220798_x_at 

22491 8_x_at 

214450_at 

231736_x_at 

205624_at 

203373_at 

205382_s_at 

212953_x_at 

203948_s__at 

238583_at 

209732_at 

233072_at 

20961 9_at 

226545_at 

238022_at 

210487_at 

200654_at 

209831_x_at 

225790_at 



samples: 20/15 

1 

1 

signal-to-noise p 

-3.8327925790289 0 

3.50808230625528 0* 

3.50754573671859 0 

3.50155234311803 0 

3.3802502122888 0 

3.21024990481945 0 

-3.1946557461653 0 

3.14485336319038 0 

2.86696122083705 0 

2.80859584810434 0 

2.78542343697266 0 

-2.73475243434259 0 

2.63743334734262 0 

-2.55642326842901 0 

-2.50061 887472005 0 

2.48587783472615 0 

-2.48460923232205 0 

2.41521499865532 0 

2.34522293184427 0 

2.31674527318944 0 



decision limit 



41 



gene symbol 

HLA-DPA1 

FLJ 11535 

MGST1 

CTSW 

MGST1 

CPA3 

STATI2 

DF . 

CALR 

MPO 

CLECSF2 
KIAA1857 
CD74 



DNTT 
P4HB 
DNASE2 



> 



BNSDOCID: <WO 03039443A2_I_> 



817 



WO 03/039443 PCT/EP02/12303 



AMLt(15;17)vs. T-ALL 

accuracy 

confidence 

gene 

203949_at 

203948_s_at 

22491 8_x_at 

231736_x_jat 

213854_at 

205382_s_at 

206871_at 

206111_at 

242292_at 

214575_s_at 

220798_x_at 

38487_at 

200654_at 

235101_at 

208689_s_at 

212953_x_at 

204348_s_at 

201537_s_at 

214450_at 

204150 at 



samples: 20 / 9 

1 

1 

signal-to-noise p 

5.76493681986315 0* 

5.52731611865475 0 

5.05683576581082 0 

4.44324813542895 0 

4.34395164475566 0 

4.13724322163424 0 

4.09729897232645 0 

3.61313451786675 0 

-3.53231266693437 0 

3.51522052980985 0 

3.50808230625528 0 

3.40780715783254 0 

3.36034425167829 0 

-3.27592263847035 0 

3.18547214467073 0 

3.18316032369792 0 

3.06277487805438 0 

2.82382291089523 0 

2.81907045269144 0 

2.73329912927614 0 



decision limit 

6205 



gene symbol 

MPO 

MPO 

MGST1 

MGST1 

SYNGR1 

DF 

ELA2 

RNASE2 

AZU1 

FLJ11535 

FLJ12442 

P4HB 

KIAA1014 

RPN2 

CALR 

AK3 

DUSP3 

CTSW 

STAB1 
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A ML t(15;17) vs. AML+8 


samples: 20/10 








accuracy 


1 








confidence 


1 






gene symbol 


gene 


siynai-iO-iiv/io«? 


n 
r* 


decision limit 


214450_at 


3.08650869451 71 


n* 
u 


1Q96 7 


CTSW 


212953_x_at 


2.71 31 955338292b 


n 
u 




CALR 


236787_at 


2.427ooUu/ yo/ /O 


n 






200952_s_at 




o 




CCND2 


205624_at 




o 




CP A3 


38487_at 


2.ooZolbZ^:oy*t 1 /D 


o 




FLJ12442 


20561 4_x_at 


2.24846130719191 


u 




MST1 


206761 _at 


2.20501 oyyy ( oooo 


n 




TACTILE 


216320_x_at 


2. 1 Do9Uboy44oyo^ 


n 






224794_s_at 


2. 1 4479331 a 1 Uood 


n 




LOC51148 


233072_at 


2. 0772 1 746 1 byo4 / 


n 




KIAA1857 


221004_s_at 


^.UOOy^OD^OtJ/ Ov30 


o 




ITM3 


227326_at 


2.0492^504239987 


0 


i 




221980_at 


2.01752809990056 


0 






225547_at 


-1.98126007880124 


0 






210145_at 


-1.95455412087558 


0 




PLA2G4A 


212509_s_at 


1.92046303571342 


0 




TPM4 


209344_at 


1.90906303754685 


0 




201029_s_at 


1.87538506046111 


0 




M1C2 


204150_at 


1.87343002266891 


0 




STAB1 



► 
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ANIL t(15;17) 
complex 

accuracy 
confidence 



vs. 



AML 



205382_s_at 

212953_x_at 

64942_at 

214450_at 

38487_at 

224794_s_at 

220798_x_at 

216032_s_at 

203948_s_at 

209732_at 

230526_at 

238022_at 

200654_at 

204150_at 

2l3447_at 

2Q6847_s_at 

203074_at 

219837_s_at 

200931_s_at 

AFFX- 

HUMGAPDH/M331 97_5_at 
HG-U133A 



samples: 20 / 36 
1 

signal-to-noise p 

2.47024872277389 0 

2.44599456599903 0* 

2.28304127550384 0 

2.2627370518124 0 

2.15395063071356 0 

2.14479331910386 0 

2.06303065394458 0 

2.05259440043708 0 

2.04245448483567 0 

-1.90630142681759 0 

-1.90204644112897 0 

1.89121106100583 0 

1.73209407132843 0 

1 .72000809746397 0 

-1.71480861978241 0 

-1.68023930751716 0 

1.65576107663154 0 

1.65191807395586 0 

-1.6356222023809 0 



1.15678566237816 



decision limit 

4652.35 



gene symbol 
DF 

CALR 

CTSW 

FLJ12442 

LOC51148 

FLJ11535 

SDBCAG84 

MPO 

CLECSF2 
FLJ20015 

P4HB 

STAB1 

IPW 

HOXA7 

ANXA8 

C17 

VCL 



GAPD 
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ANIL t(15;17) vs. ANIL normal 

accuracy 

confidence 

gene 

212953_x_at 

214450_at 

203948_s_at 

38487_at 

224794_s_at 

233072_at 

221004__s_at 

236787_at 

209732_at 

214651_s_at 

216032_s_at 

227326_at 

200952_s_at 

208852_s_at 

200654_at 

204150_at 

64942_at 
203949_at 
235753_at 
209344_at 



samples: 20 / 62 

1 

1 

signal-to-noise P 

2.64618373092816 0* 

2.28653040002124 0 

2.28093863578688 0 

2.27897339525457 0 

2.14479331910386 0 

2.02319157581908 0 

2.01169552990941 0 

1.97410536661333 0 

-1.93278822451406 0 

-1.88045707977072 0 

1.84310066944834 0 

1.83154138034996 0 

1.79422847402715 0 

1.77896258332914 0 

1.77731955574697 0 

1.77067560905128 0 

1.71976456537364 0 

1.69386661503752 0 

-1.67072227461966 0 

1.65496211687573 0 



decision limit 

4541.1 



gene symbol 

CALR 

CTSW 

MPO 

FLJ12442 

LOC51148 

KIAA1857 

ITM3 

CLECSF2 

HOXA9 

SDBCAG84 

CCND2 
CANX 
P4HB 
STAB1 

MPO 

TPM4 
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ANIL t(15;17) vs. ANIL t(8;21) 

accuracy 

confidence 

gene 

214450_at 

38487_at 

209732_at 

204150_at 

201596_x_at 

213944_x_at 

230526_at 

212509_s_at 

211990_at 

20431 9_s_at 

20561 4_x_at 

216320_x_at 

224794_s_at 

224839_s_at 

227326_at 

238365_s_at 

228827_at 

228570_at 

205349_at 

200986_at 



samples: 20/13 

1 

1 

signal-to-noise P 

3.53862069365814 0* 

3.30334484935728 0 

-3.18277220746091 0 

2.73329912927614 0 

2.73162867034962 0 

2.5862372690463 0 

-2.42276930706474 0 

2.33481477262277 0 

-2.3270695509372 0 

-2.25060861801642 0 

2.24846130719191 0 

2.16690689445934 0 

2.14479331910386 0 

2.07810412712239 0 

2.04924504239987 0 

2.03674279873081 0 

-2.03460798747208 0 

2.03169244854036 0 

2.02142471684528 0 

2.01484061650733 0 



decision limit 

1809.95 



gene symbol 

CTSW 

FLJ12442 

CLECSF2 

STAB1 

KRT18 

FLJ20015 

HLA-DPA1 

RGS10 

MST1 

LOC51148 
GPT2 



GNA15 
SERPING1 
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AML inv(16) vs. all other samples: 12 / 257 



accuracy 


1 






confidence 


0.7 8632 161931 2236 






gene 


signal-to-noise 


P 


Horkinn limit nene svmb 


201497_x_at 


1 .748528451 85764 


0 


MYH1 1 


231310_at 


1.47164364462932 


0 




241525_at 


\ .40464277333052 


0 




233555_s_at 


1.36141951191384 


0 


Aczo n^ 


224724_jat 


1.35707926936144 


0 


i/iaa-i OA7 


200665_s_at 


1 .34209700274375 


0 


QPARp 


202370_s_at 


-1 .29245470192814 


0 


UDr D 


222862_s_at 


1.26048366428059 


0 


Al/C 


200675_jat 


-1.23914248971997 


0 


OUO l 


205076_s_at 


1 .2201 7942852008 


U 


CRA 


201496_x_jat 


1.21510481759962 


0 


ivi T n i i 


223385_at 


1.2113380867898 


0 


fVDOCI 


221486_at 


-1.191,52063013451 


0 


i 


223471_at 


-1.18312106893472 


0 




206135__at 


1.18181935093584 


0 


KIAA0535 


206956_at 


1.17922310970084 


0 


BGLAP 


235359_at 


1.16409001423194 


0 




20201 6_at 


1.15872909212242 


0* 


1 MEST 


201324_at 


1.15017496156271 


0 


EMP1 


AFFX-HUMRGE/M1 0098_5_at 








-HG-U133B 


0.737113176900438 


0 
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AML inv(16) vs. ANIL WILL 

accuracy 

confidence 

gene 

200951 _s_at 

228058_at 

219271_at 

231259_s_at 

214651_s_at 

202551_s_at 

205453_jat 

200953_s_at 

213737_x_at 

23581 8_at 

225653_at 

232636_at 

202746_j3t 

200665_s_at 

203949_at 

202552_s_ at 
22347 1_at 
201828_x_at 
235359_at 
202747_s_at 



samples: 12/15 

1 

1 

signal-to-noise p 

4.24994685464806 0* 

3.25212044058077 0 

2.57049778814556 0 

2.43255056573718 0 

-2.30388406553935 0 

2.25890379783091 0 

2.23059959679219 0 

2.19700687874039 0 

-2.16186095833837 0 

2.13054793207832 0 

2.10834669134201 0 

-2.07022186491858 0 

2.06211630393441 0 

2.03751489015447 0 

2.01364277991339 0 

1.96670486082105 0 

-1.96529988161274 0 

-1.9593150488894 0 

1.95403665761428 0 

1.9384101680124 0 



decision limit 

75.6 



gene symbol 
CCND2 

FLJ 12691 

CCND2 

HOXA9 

CRIM1 

HOXB2 

CCND2 



SPARC 

MPO 

CRIM1 

CXX1 

ITM2A 



824 



BNSDOCID: <WO_ 



_03039443A2_I_> 



WO 03/039443 



PCT/EP02/12303 



AML inv(16) vs. CLL 

accuracy 

confidence 

gene 

203949_at 

211709_s_at 

203948_s_at 

231310_at 

226043_at 

201417_at 

209365_s_at 

224838_at 

243000_at 

223382_s_at 

201163_s_at 

202862_at 

231982_at 

201162_at 

218942_at 

212827_at 

22471 0_at 

208864_s_at 

211787_s_at 

205382_s_at 



samples: 12/32 

1 

1 

signal-to-noise P 

6.95021247410033 0* 

5.18681505358433 0 

4.72760268663435 0 

4.35158499850242 0 

4.31805631251135 0 

3.83165504179581 0 

3.8293529362527 0 

-3.72164043642693 0 

3.63182045730937 0 

3.56679694464951 0 
3.21223463217947 0 
3.20454480816595 0 
3.1934694742924 0 
3.15702502490111 0 
-3.1335063798458 0 
. -3.11017349120191 0 
3.04332806808412 0 
3.01671716600197 0 
2.90916234544607 0 
2.90564641349051 0 



decision limit 

3746.5 



gene symbol 
MPO 
SCGF 
MPO 

AGS3 

ECM1 



NIN283 
IGFBP7 
FAH 

IGFBP7 

FLJ22055 

IGHM 

RAB34 

TXN 

EIF4A1 

DF 
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AML inv(16) vs.CML 

accuracy 

confidence 

gene 

201029_s_at 

209365_s_at 

226844_at 

20977 1_x_at 

206440_at 

233138_at 

216379_x_at 

227749_at 

209772_s_at 

210982_s_at 

218942_at 

206676_at 

204661_at 

212531_at 

208306_x_at 

208890_s_at 

216015_s_at 

223839_s_at 

207802_at 

34210 at 



samples: 12/14 

1 

1 

signal-to-noise p 

4.31890554977499 0* 

3.8293529362527 0 

3.75460881300519 0 

-3.55889840129944 0 

-3.54851691296324 0 

3.4735123827114 0 

-3.43948341863688 0 

3.41648620394084 0 

-3.3477199485593 0 

3.24778523856649 0 

-3.24708603988632 0 

-3.24497920147189 0 

3.10482803895821 0 

-3.09841450182506 0 

3.05465357697194 0 

2.99789197915585 0 

2.9548034027314 0 

-2.95162251991881 0 

-2.93213346323135 0 

2.86220111493221 0 



decision limit 
4017 



gene symbol 

MIC2 

ECM1 

CD24 
VEL11 



CD24 

HLA-DRA 

FLJ22055 

CEACAM8 

CDW52 

LCN2 

HLA-DRB4 

PLXNB2 



SGP28 
CDW52 
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A Rill !n»/<1C\ we normal RM 

A ML inv(lo; vs. normal dwi 


Samples. \£. / & 








accuracy 


A 
I 








confidence ( 


i 








gene 


<z\ nnfl I -to-noise 


p 


decision limit 


gene symbol 


225792_ at 


040^7964761662 


0* 


76.45 




22497o_at 


^ niQ1fl096Q03577 


o 




NFIA 


22497o_at 


-A <^4nn677R733 


o 




NFIA 


23obb2_at 


4 911 165Q0 ^69832 


o 






209365__S_at 


^ R9Q^ c ;9Q'369 e i97 


o 




ECM1 


223044_at 


O ~71Q7rkOft9*3K9^17 


n 




SLC11A3 


226326_at 


-O.DyoUUoUT I U / 


n 






200832_s_at 


Q C7c;7^Q.4Q9R47nft 


o 




SCD. 


21 3288__at 


o cccocnQPi c;Afln^ 

-o.OODOOVjyO IODOVJO 


n 






230988_at 


Q /I CQyl QQ H OQQQC 1 9 

-o.4oy4oyn oyyyo iz 


n 






205382_s_at 


o.^yu/ oyouzyzooy 


n 




DF 


201417_at 


3.2768271 8704937 


U 






225897_at 


-3.212,86424500781 


0 






226299_at 


3.21097313269574 


0 




pknbeta 


210933_s_at 


3.17596755096177 


0 




MGC4655 


218094_s_at 


3.1307319856851 


0 




C20orf35 


201029_s_at 


3.10235325372561 


0 




M1C2 


210036_s_at 


-3.09612221936927 


0 




KCNH2 


212667_at 


3.07883651171274 


0 




SPARC 


40189 at 


2.32056421839438 


0 




SET 



1 
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ANIL inv(16) vs. ALL t(8;14) 


_ — ■ * / A 

samples, \£ / ** 








accuracy 


1 








confidence 


1 






gene symbol 


gene 


signal-to-noise 


P 


decision limit 


214558__at 


5.111 9881 iduotwu / 


n* 


75.15 


GPR12 


227525_at 


-4.92871 8o4b41 <to 


n 




LOC 113263 


211852_s_at 


4.7598068ooODooo 


n 
u 




ATRN 


238652_at 


4.21 1 1 boyuoozoo.*: 


o 




DISC1 


206090_s_jat 


4. 1 1 / / y^oouuo \ yy 


0 




237864_at 


3.94349147195948 


o 

0 






225051_at 


-3.899301 7022yyo4£ 


n 






209365_s_at 


3. 8293529362527 


0 






223382_s_at 


3. 5667969446430 1 


n 




NIN283 


201278_at 


« ^/"n^O COO «l C707 

3.3859235881 o7o/ 


n 
u 




DAB2 


219654_at 


3.2551 940791 4b 1 b 


n 
u 




PTPLA 


202074_s_at 




o 




OPTN 


231982_at 


3.1934694742924 


0 


• 




231310_at 


3.18344789439715 


0 






210933_s_at 


3.17596755096177 


0 




MGC4655 


222062_at 


3.12204176758762 


0 




WSX1 


242774_at 


-3.10884740483827 


0 




SYNE-2 


244257_at 


3.09069018899455 


0 




SPARC 


212667_at 


3.07883651171274 


0 




225305_at 


2.99842864962803 


0 
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AML inv(16) vs. ALL B not Ph 

accuracy 

confidence 

gene 

205382_s_at 

206772_at 

228058_at 

218094_s_at 

223385_at 

204858_s_at 

227556_at 

212463_at 

201360_at 

205997_at 

231310_at 

203973_s_at 

200872_at 

200661_at 

22741 5_at 

219358_s_at 

20201 6_at 

208248_x_at 

212188_at 

205076_s_at 



samples: 12/9 

1 

1 

signal-to-noise P 

3.44559828536398 0* 

3.36924689882826 0 

3.25212044058077 0 

3.1307319856851 0 

3.07706676910952 0 

2.96720397750981 0 

2.66258700650327 0 

-2.65372355495687 0 

2.63414688358486 0 

2.6308573171159 0 

2.60181313279404 0 

2.4714469340582 0 

2.42523459110761 0 

2.42407433324859 0 

-2.35663515228965 0 

2.31004642323513 0 

2.30727907426241 0 

2.27797183411701 0 

2.26974263089413 0 

2.257959447417 0 



decision limit 

2092.2 



gene symbol 
DF 

PTHR2 

C20orf35 
CYP2S1 
ECGF1 
ATP1B1 

CST3 
ADAM28 

CEBPD 

S100A10 

PPGB 

CENTA2 
MEST 
APLP2 
LOC1 15207 
CRA 



BNSDOCID: <WO 03039443A2_I_> 



829 



WO 03/039443 PCT/EP02/1 2303 



AML inv(16) vs. ALL Ph 

accuracy 

confidence 

gene 

231310_at 

203373_at 

210487_jat 

209735_at 

22331 4_at 

227276_at 

205382_s_at 

200872_at 

200985_s_at 

207224_s_at 

208702_x_at 

209975_at 

219452_at 

22491 8_x_at 

200661 _at 

205076_s_at 

203092_at 

217979_at 

228058_at 

210609_s_at 



samples: 12/15 

1 

1 

signal-to-noise P 

3.26954611223081 0* 

-3.07194642697668 0 

-2.57922739606335 0 

-2.57442202182339 0 

-2.46037319479876 0 

2.31171718422321 0 

2.30057319708762 0 

2.27656979916265 0 

-2.25258857041194 0 

2.24976284058658 0 

2.21820176325777 0 

2.21237181448127 0 

2.18614057488219 0 

2.18042008960618 0 

2.15820911658542 0 

2.1576215121228 0 

-2.1346943506478 0 

-2.13131555913964 0 

2.11586185512013 0 

2.1104858897821 0 



decision limit 

119.2 



gene symbol 

STAT12 

DNTT 

ABCG2 

MGC11352 

TEM7R 

DF 

S100A10 

CD59 

SIGLEC7 

APLP2 

CYP2E 

LOC64174 

MGST1 

PPGB 

CRA 

TIMM44 

NET-6 

PIG3 
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ANIL inv(16) vs. T-ALL 

accuracy 

confidence 

gene 

203949_at 

205640_at 

203948_s_at 

204484_at 

223482_at 

205382_sjat 

242292_at 

22491 8_x_at 

228058_at 

210314_x_at 

216015_s_at 

206380_s_at 

202944_at 

231736_x_at 

209500_x_at 

20531 2_at 

211495_x_at 

211101_x_at 

219763_at 

225510 at 



samples: 12/9 

1 

1 

signal-to-noise p 

5.55902965521097 0* 

4.30508511818292 0 

4.299250186967 0 

-3.92079658637056 0 

3.8867405806948 0 

3.78294993092265 0 

-3.53231266693437 0 

3.39770561886615 0 

3.25212044058077 0 

3.15624540914219 0 

2.9548034027314 0 

2.92913137540268 0 

2.90807873450708 0 

2.90529949191742 0 

2.88541455460133 0 

2.86437379660673 0 

2.85808523976773 0 

2.82479786118477 0 

2.71839322196283 0 

2.68685955542078 0 



decision limit 

4116.65 



gene symbol 

MPO 

ALDH3B1 

MPO 

PIK3C2B 

TMPIT 

DF 

MGST1 

TNFSF13 

PFC 

NAGA 

MGST1 

TNFSF13 

SPI1 

TNFSF13 
LILRA2 
KIAA1608 
CHN1 
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AML inv(16) vs. AWIL +8 


samples: 12/10 








accuracy 


1 








confidence 


1 






gene symbol 


gene 


signal-to-noise 


P 


decision limit 


233138_at 


3.4735123827114 


0* 




PPM1 

C Owl 1 


209365_s_at 


3.00986865470169 


0 




202283_at 


2.6160707896185 


0 




ocrvr iini i 


218942_at 


-2.45751 9041 92227 


u 




FLJ22055 


201828_x_at 


-2.21 365274281 998 


U 




CXX1 


200951_s_jat 


2.02357943949022 


0 






226120__at 


-2.01082485261142 


0 




I nn?3016 

LV_/w 1 twU l vj 


203188_at 


-1 .9379696222037 


U 




B3GNT6 


202085_at 


-1.9147903697218 


0 




T IP9 


206135_at 


1 .90877055638373 


U 




KIAA0535 


210024_s__at 


-1 .903309 ( oul oooo 


n 




UBE2E3 


204661_at 


1 .83776082826379 


U 




CDW52 


241525_at 


1 .83006374766269 


r\ 

u 




AP3D1 


20871 0_s_at 


-1.80896001968559 


0 




3421 0_at - 


1.75572696362264 


0 




CDW52 


201497_x_at 


1.74852845185764 


0 




MYH11 


212236_x_at 


1.74679406476776 


0 






213810_s_at 


1.69077455623534 


0 




FU 10342 


212250_at 


-1.62885302351785 


0 






AFFX-r2-Hs1 8SrRNA-5_at 










HG-U133A 


1.32018767826727 


4 0 







BNSDOCIO: <WO 03039443A2_I_> 



832 



WO 03/039443 



PCT/EP02/ 12303 



AML mV\lo) VS. MiviL. complex 


<5amnles* 12 / 36 






accuracy 


A 
\ 






confidence 








gene 


o 5 nn^l -tr>-noi ?iP 

Oly 1 |UI"IW 1 IUIOC 


n 


decision limit gene symbol 


203092_at 






TIMM44 


209190_s_at 


I. (Of ZoOHl 040 1 4 I 


n* 

u 


1593.8 DIAPH1 


201497_x_at 


1 . / 4dOiio*fon oo/ o*t 


n* 


134.75 MYH11 


205076_s_at 




n 


CRA 


241525_at 


A 71 ARO,4ft'V?'?^Q7Q 


n 




213779_at 


-1 . / 1 ODHOOZZOZOO f 


n 
u 




210982_s_at 


H QQQP7H ^1 Q 


u 


HLA-DRA 


200985_s_at 


-1 .b4o4oyyooo44»5o 


n 
u 


CD59 


212463_at 


-1 .b4o/ y4i uozf yy 


n 

u 




200675_at 


-1 .Di o4o/ ooo^o^y 


u 


707 CD81 


218942_at 


-1 .59347299102441 


u 


FLJ22055 


200984_s_at 


-1 .56833724351 535 


0 




208894_at 


1.54975491884609 


0 


HLA-DRA 


202265_at 


-1.53139324627965 


0 


BMI1 


224724_at 


1.5143785002027 


0 


KIAA1247 


210715_s_at 


-1.49004107536748 


0 


SPINT2 


213452_at 


-1.48522101377482 


0 


ZNF184 


205382_s_at 


1.45942422076027 


0 


DF 


201360_at 


1.45905524413008 


0 


CST3 


206135 at 


1.45729112913321 


0 


KIAA0535 



BNSDOCID: <WO 03039443A2_I_> 



833 



WO 03/039443 PCT/EP02/1 2303 



AML inv(16) vs. AML normal 

accuracy 

confidence 

gene 

200951_s_at 

209365_s_at 

214651_s_at 

201497_x_at 

23131 0_at 

223385_at 

206135_jat 

235753_at 

231259_s_at 

204661 _at 

202370_s_at 

213353_at 

209905_at 

200675_at 

207194_s_at 

201324_at 

23581 8_at 

22551 0_at 

228834_at 

3421 0_at 



samples: 12/62 
1 



n Q^9Q58Q7823021 2 




cinnal-to-noise 

ol^l lul Iw IIVIwV 


D 


1 89Q1 64866761 22 


o 




0* 


-1 777Qfin^0Qfi8' : V74 


0* 


1 74RR98451 85764 


o 


1 fi94Rn^fi77fi0Q99 


0* 




o 


I .O*fi70 IDUUZ I IOOO 


o 


1 ^4RQ1 44fi07R989 


o 


1 ^9R81 ^n^1 74R01 


o 


1 .46205490369508 


0 


-1.45859936190513 


0 


-1.44758735114725 


0 


-1.42239381454703 


0 


-1.38662888859944 


0 


1.38591797740996 


0 


1.36182933080382 


0 


1.35992709972839 


0 


1.35514122232308 


0 


1.35066870620531 


0 


1.33546826742244 


0 



decision limit gene symbol 

CCND2 
390.9 ECM1 
187.6 HOXA9 
MYH11 

167.8 

CYP2S1 
KIAA0535 

CCND2 

CDW52 

CBFB 

ABCA5 

HOXA9 

CD81 

ICAM4 

EMP1 

CHN1 
T0B1 
CDW52 



BNSOOCID: <WO 03039443A2_I_> 



834 



WO 03/039443 



PCT/EP02/12303 



AML inv(16) vs. AML t(8;21) 

accuracy 

confidence 

gene 

233138_at 

202283_at 

201596_x_at 

233555_s_at 

22681 8_at 

212828_at 

227276_at 

224724_at 

224764_at 

224049_at 

205453_at 

226841_at 

209975_at 

205076_s_at 

202340_x_at 

210314_x_at 

34689_at 

235359_at 

212188_at 

20571 8_at 



samples: 12/13 

1 

1 

signal-to-noise p 

3.4735123827114 0* 

2.6160707896185 0 

2.55517188589615 0 

2.48943541958708 0 

2.3621676751726 0 

2.33130605042964 0 

2.31171718422321 0 

2.30850265580909 0 

2.25934489179779 0 

2.25244855640038 0 

2.23059959679219 0 

2.21329419316194 0 

2.21237181448127 0 

2.21083117233863 0 

2.18671963481275' 0 

2.17138407196792 0 

2.16463995293403 0 

2.12423469465025 0 

2.11590883979512 0 

2.10547132123535 0 



decision limit 

54.25 



gene symbol 

SERPINF1 
KRT18 



SYNJ2 

TEM7R 

KIAA1247 

ARHGAP10 

KCNK17 

HOXB2 

CYP2E 

CRA 

NR4A1 

TNFSF13 

TREX1 

LOC1 15207 
ITGB7 



BNSDOCID: <WO 03039443A2J_> 



835 



WO 03/039443 



PCT/EP02/12303 



AML MLL vs. all other 

accuracy 
confidence 
failed: 
gene 

228083_at 

201105_at 

205849_s_at 

205472_s_at 

208702_x_at 

225700_at 

222982_x_at 

238856_s_at 

214875_x_at 

20961 6_s_at 

205471_s__at 

211404_s_at 

213857_s_at 

204951_at 

203544_s_at 

204082_at 

219360_s_at 

235604_x_at 

200742_s_at 

217520_x_at 



samples: 15/254 
0.977695167286245 
0.736329320600874 
3,6,7,11,155,212 

signal-to-noise p 

1.21941234348391 0 

1.20839678060713 0* 

1.03753805567643 0 

1.03076838857205 0 

1.01256627546813 0 

-1.01092153300039 0 

-1.00741126543277 0* 

-0.99041 5633837559 0* 

0.989164186017564 0 

0.966839890517189 0 

0.957394713027089 0 

0.942572231076861 0 

-0.933745825960995 0 

-0.925961455918277 0 

-0.922231 323262765 0 

0.910815804799754 0 

0.902459945140701 0 

-0. 8951 50647295372 0 

0.89214325035062 0 

0.891291087387708 ' 0 



decision limit 

8795.05 



1101 
1 



gene symbol 

LGALS1 
UQCRB 
DACH 
APLP2 

SLC38A2 

APLP2 

CES1 

DACH 

APLP2 

CD47 

ARHH 

STAM 

PBX3 

TRPM4 

CLN2 



836 



BNSDOCID. <WO__ 



_03039443A2_t_> 



WO 03/039443 



PCT/EP02/12303 



AML MLL vs. CLL 


samples: 15/32 








accuracv 


1 








confidencG 


1 








gene 


signal-to-noise 


P 


decision limit 


gene symbol 


224838__at 


-3.80748815117863 


0* 


1810.6 




212400 at 


-2.86025920523167 


0 






212827 at 


-2.83572202473321 


0 




IGHM 


223514 at 


-2.79698500849851 


0 




CARD11 


207168_s_at 


2.78000663400662 


0 




H2AFY 


208456 s at 


-2.77288752188536 


0 




RRAS2 


206111 at 


2.74648404080317 


0 




RNASE2 


204951 at 


-2.73367910105826 


0 




ARHH 


204215 at 


-2.70725768797715 


0 




MGC4175 


41220 at 


-2.6474489391 5855 


0 




MSF 


227173_s_at 


-2.60156567650878 


0 




BACH2 


211404_s_at 


2.58701495228114 


0 




APLP2 


201163 s at 


2.58627447208644 


0 




IGFBP7 




-2.5720529437702 


0 






202880_s_at 


-2.55714780536776 


0 




PSCD1 


224837_at 


-2.52465899436623 


0 




FOXP1 


209374_s_at 


-2.50451803785685 


0 




IGHM 


243780_at 


-2.49073709912546 


0 






212590_at 


-2.45765382458849 


0 






AFFX- 










HUMGAPDH/M331 97_3_at 










HG-U133B 


2.15372303795555 


0 




GAPD 



837 
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AML WILL vs. CWIL 

accuracy 

confidence 

gene 

20977 1_x_at 

216379_x_at 

212531_at 

206676_at 

205557_at 

209772_s_at 

211657_at 

203757_s_at 

20201 8_s_at 

20551 3_at 

207269_jat 

207802_at 

236908_at 

266_s_at 

201105_at 

208890_s_at 

210244_at 

214875_x_at 

236979_at 

49306 at 



samples: 15/14 

1 

1 

signal-to-noise p 

-4.47855739740896 0* 

-4.47782401977517 0 

-4.20914790815671 0 

-4.08016979869995 0 

-3.89666780981116 0 

-3.7540800038095 0 

-3.35563391812807 0 

-3.27728001846237 0 

-3.2413562747036 0 

-3.11724133997788 0 

-3.04676950166614 0 

-2.84753376114576 0 

-2.83593202717857 0 

-2.75915771343774 0 

2.73676787442809 0 

2.63909820141829 0 

-2.58291056673136 0 

2.54486223713713 0 

-2.51234802423843 0 

1.35686961755716 0 



decision limit 

4909 



gene symbol 
CD24 

LCN2 
CEACAM8 
BPI 
CD24 

CEACAM6 

LTF 

TCN1 

DEFA4 

SGP28 

CD24 

LGALS1 

PLXNB2 

CAMP 

APLP2 

AD037 



838 



BNSDOCID: <WO_ 



_03039443A2_I_> 



WO 03/039443 



PCT/EP02/12303 



A Hill UNI 1 w*» n/trmal DM 

A ML MLL VS. normal divi 


samDles* 15/9 








accuracy 


i 








confidence 


i 






gene symbol 


gene 


c:innal-to-noisG 


p 


decision limit 


225792_at 


-5 04037264761662 


0* 


76.45 




22781 2_at 


_d ^0179833847856 


0 






236908_at 


-h. odd uyyyiz/ o i 


n 






225700_at 


-d 90778530270981 


o 






20201 8_s_at 




o 




LTF 


225804_at 


^ 7ft4^RR5Q^9981 


o 






212531_at 


^ R91 RQR^fifl88899 


o 




LCN2 


208651 _x_at 


-3.081 31 561 292721 


n 




CD24 , 


209771 _x_at 


o «nc;^i 449^0^7^7 


o 




CD24 


216379_x_at 


-2 . 747 1 8662648409 


U 




CAMP 


210244_at 


-2 . 74600903oo3boy 


n 
u 




226301 _at 


-9 70321745971481 


0 




dJ55C23.6 


227151_at 


2.68181697471281 


0 




AKR1C3 


209160_at 


-2.64474285212333 


0 




225285_at 


-2.56569832423798 


0 






23581 8_at 


-2.5546546541523 


0 






226726_at 


-2.52741579223109 


0 




LOC 129642 


207269_at 


-2.43598973973471 


0 




DEFA4 


226751_at 


-2.41273912703748 


0 




DKF2P566K1924 


58780_s_at 


1 .24593648437062 


0 




FLJ 10357 



1 



BNSDOCID: <WO 03039443A2_I_> 



839 



WO 03/039443 
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AML WILL vs. ALL t(8;14) 

accuracy 

confidence 

gene 

227173_s_at 

225700_at 

212400_at 

227151_at 

205690_s_at 

231552_at 

239647_at 

239835_at 

226435_at 

226301_at 

226607_at 

215785_s_at 

225085_at 

214651_s_at 

209765_at 

236606_at 

218319_at 

227525_at 

212538_at 

228083 at 



samples: 15/4 

1 

1 

signal-to-noise p 

-2.97620214103741 0* 

-2.97113581211542 0 

-2.91449782508721 0 

2.68181697471281 0 

-2.63484847538673 0 

2.60340424456959 0 

2.56381751217569 0 

-2.49692099181388 0 

2.38716130768923 0 

-2.38404537685874 0 

2.34518225140996 0 

-2.31901965008779 0 

2.24977026547419 0 

2.24855870756181 0 
-2.20363939375933 - 0 

-2.19711183569464 0 

-2.1925818315627 0 

-2.16383106825606 0 

-2.11358391570231 0 

2.10218682980016 0 



decision limit 

93.35 



gene symbol 
BACH2 



G10 



KIAA1842 

dJ55C23.6 

L3MBTL 

CYFIP2 

HOXA9 
AD AM 19 

PELI1 

LOC1 13263 
K1AA1058 



840 



BNSDOCID: <WO_ 



_03039443A2_I_> 



WO 03/039443 



PCT/EP02/ 12303 



ANIL WILL vs. ALL B not Ph 

accuracy 

confidence 

gene 

201482_at 

201105_at 

239647_at 

211404_s_at 

201360_at 

214875_x_at 

266_s_at 

208702_x_at 

223469_at 

209771 _x_at 

208248_x_at 

230441 _at 

200742_s_at 

216379_x_at 

20421 5_at 

208650_s_at 

214651_s_at 

23381 3_at 

208703_s_at 

58780 s at 



samples: 15/9 

1 

1 

signal-to-noise p 

2.87636338319455 0* 

2.64910601922115 0 

2.56381751217569 0 

2.31189269578112 0 

2.27456235196276 0 

2.20140608739693 0 

-2.19196810995278 0 

2.17966631844281 0 

2.13015217253256 0 

-2.12750959401045 0 

2.10716186347527 0 

-2.03717805375485 0 

2.01183913393136 0 

-1,97574478072792 0 

-1.96532146024999 0 

-1.93906624919753 0 

1.92853927147488 0 

-1.8799760260639 0 

1.86864966354898 0 

1.40471564488899 0 



decision limit 

363.95 



gene symbol 

QSCN6 

LGALS1 

APLP2 

CST3 

APLP2 

CD24 

APLP2 

MGC10812 

CD24 

APLP2 

CLN2 

MGC4175 

CD24 

HOXA9 

APLP2 
FLJ10357 



BNSDOCID: <WO 03039443A2 I > 



841 



WO 03/039443 
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AML WILL vs. ALL Ph 

accuracy 

confidence 

gene 

210487_at 

211404_s_at 

214875_x_at 

208702_x_at 

214651_s_at 

234107_s_at 

201105_at 

239647_at 

217979_at 

213150_at 

212012_at 

207971_s_at 

200742_s_at 

236606_jat 

235753_at 

232035_at 

41220_at 

20977 1_x_jat 

1007_s_at 

201968_s_at 



samples: 15/15 

1 

1 

signal-to-noise P 

-2.66400596339357 0 

2.5799328562955 0 

2.53091704146699 0 

2.442063176235 0 

2.38931027314895 0 

2.17019003130007 0* 

2.08770407459668 0 

2.02467198705529 0 

-1.99927745769533 0 

1.98436589229381 0 

-1.96026252932157 0 

-1.95442302553682 0 

1.93172870108821 0 

-1.90758120714248 0 

1.897686621 61 529- 0 

-1.87493486925109 0 

-1.84432301556426 0 

-1.83370834497243 0 
-1.8257200735147 0 
1.81818231032019 0 



decision limit 



321.6 



gene symbol 

DNTT 

APLP2 

APLP2 

APLP2 

HOXA9 

LGALS1 

NET-6 

HOXA10 

D2S448 

KIAA0582 

CLN2 



H4FH 

MSF 

CD24 

DDR1 

PGM1 



BNSDOCID: <WO 03039443A2_I_> 



842 



WO 03/039443 



PCT/EP02/12303 



AML WILL vs. T-ALL 

accuracy 

confidence 

gene 

242292_at 

243154_at 

219891_at 

211404_s_at 

209499_x_at 

211495_x_at 

208702_x_at 

214875_x_at 

2061 1 1_at 

209500_x_at 

210314_x_at 

225003_at 

218122_s_at 

204484_at 

205640_at 

201105_at 

200743_s_at 

222698_s_at 

200742_s_at 

201537 s at 



samples: 15/9 

1 

1 

signal-to-noise p 

-3.53231266693437 0* 

-3.1306973304726 0 

3.05552709838153 0 

2.9355923181049 0 

2.75677919960856 0 

2.73018317509301 0 

2.59886997802643 0 

2.52933264502392 0 

2.43695278026584 0 

2.41035664982945 0 

2.38921906328214 0 

2.37212829914098 0 

-2.30056189156697 0 

-2.28386643829897 0 

2.26383232065516 0 

2.26087708407154 0 

2.25956196871586 0 

2.23788003936299 0 

2.2220970534291 0 

2.16766347414866 0 



decision limit 

19.4 



gene symbol 



FLJ20208 

APLP2 

TNFSF13 

TNFSF13 

APLP2 

APLP2 

RNASE2 

TNFSF13 

TNFSF13 

MBC3205 

SENP2 

PIK3C2B 

ALDH3B1 

LGALS1 

CLN2 

IMPACT 

CLN2 

DUSP3 



843 



BNSDOCID: <WO__ 



_03039443A2_I_> 



WO 03/039443 
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ANIL MLL vs. AML +8 


samples: 15/10 








accuracy 


1 








confidence 


0.8689072208975 








gene 


signal-to-noise 


P 


decision limit 


gene symbol 


200056_s_at - HG-U133A 


-1 .60303643777462 


0 




C1D 


212250_at 


-1 .47262825515036 


0 






228024_at 


-1.43968630786794 


0 




PAK1 


205355_at 


-1.39534301157978 


0 




AC ADS B 


225700_at 


-1.39419450937928 


0.01 






234726_s_at 


1.36257922965513 


0 






20261 9_s_at 


-1.34323877280623 


0 




PLOD2 


205453_at 


-1.29520111839967 


0 




HOXB2 


202823_at 


-1.28255326311509 


0 




TCEB1 


210749_x_at 


-1.26893758929818 


0 




DDR1 


227786_at 


-1.24020773870069 


0 




TRAP25 


212479_s_at 


-1.23922427261637 


0* 


437.3 


FLJ 13910 


219312_s_at 


-1.21983738431424 


0 




RINZF 


218172_s_at 


-1.21403337241271 


0* 


285.95 


PR02577 


200867_at 


-1.2110165320696 


0* 


531 




202956_at 


-1.21043821718202 


0* 


550.6 


BIG1 


213902_at 


-1.19710457307816 


0 




ASAH 


239597_at 


1.19444081572455 


0 






214789_x_at 


-1.19367799101574 


0 




SRP46 


AFFX-r2-Ec-bioD-3_jat - 


HG- 








U133B 


0.789398697044362 


0 







844 



BNSDOCID: <WO 03039443A2J_> 
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AML WILL vs. AML complex 

accuracy 

confidence 

gene 

228083_at 

201105_at 

201377_at 

201358_s_at 

203387_s_at 

201585_s_at 

222982_x_at 

202746_at 

204951_at 

203725_at 

203544_s_at 

225804_at 

203386_at 

201359_at 

201830_s_at 

218041_x_at 

22331 8_s_at 

212222_at 

201829_at 

239647 at 



samples: 15/36 
1 

0.928537448772464 

signal-to-noise p 

1.83691594955677 0 

1.56013660814198 0 

-1.5390279619994 0 

-1.46121373620596 0 

-1.43200690176451 0 

-1.42612880048733 0 

-1.4232823410753 0 

-1.40541613814493 0* 

-1 .38290450200254 0* 

-1.35090921185734 0 

-1.33195316834084 0 

1.3109814684568 0 

-1.30903268579411 0 

-1.3066438308133 0 

-1.29170882160348 0 

-1 .27263528593652 0 

1.27042416454958 0 

-1.26786775770503 0 

-1.26235979300155 0 

1.09622576767215 0 



decision limit gene symbol 

LGALS1 

KIAA0144 

COPB 

KIAA0603 

SFPQ 

SLC38A2 



277.15 
224.05 



ARHH 

GADD45A 

STAM 

KIAA0603 

COPB 

NET1 

PRO1068 

MGC10974 

KIAA0077 

NET1 



BNSDOCID: <WO 03039443A2J_> 



845 



WO 03/039443 



PCT/EP02/12303 



AML WILL vs. AML normal 

accuracy 
confidence 
failed: 
gene 

205453_at 

222465_at 

225406_at 

200829_x_at 

225326_at 

200056_s_at - HG-U133B 

238856_s_at 

227786_at 

200673_at 

226250_at 

233559_s_at 

228904_at 

227680_at 

223982_s_at 

202377_at 

225700_at 

209160_at 

229232_at 

231870_s_at 

201105_at 



samples: 15/62 
0.948051948051948 
0.903617550418171 
3,4,72,76 

signal-to-noise p 

-1.38558116676561 0* 

-1.29236441992376 0 

-1.29036630213613 0* 

-1.1349148684836 0 

-1.12776286034489 0 

-1.12696320763032 0 

-1.09612740485765 0 

-1.08912698999125 0 

-1.08754065564232 0 

-1.08598948671591 0 

1.08564709559314 0 

-1.07283266172579 0 

-1.06344466244967 0 

-1.06119523760418 0 

-1.05377199670316 0 

-1.04818024157045 0 

-1 .04660040264575 0 

-1.04476453216419 0 

-1.04348230442961 0 

1.03632090944134 0 



decision limit gene symbol 

203.25 HOXB2 
C15orf15 

191.3 TSG 

ZNF207 

KIAA1311 

C1D 

TRAP25 
LAPTM4A 

FENS-1 



IPLA2 

HSOBRGRP 

AKR1C3 

LOC51068 
LGALS1 



BNSDOCID: <WO 03039443A2J_> 



846 



WO 03/(139443 



PCT/EP02/12303 



ami Rill 1 \/c AMI tffl-9il 
AML WILL VS. MIVIL HO,a l; 


samDles* 15/13 




accuracy 


1 
i 




confidence 


1 




gene 


<^ianal-to-noise 


p 


214651_s — at 


2 421 14497347366 


0 


201105_at 


2 17374439391796 


0* 


228827_ at 


-9 03460798747208 


0 


206009_at 


-9 02655766969028 


0 


228083_at 


1 QQ7682447 10951 


0 


213150_at 




0 


5022 1_at 




0 


221581_s_at 




0 


235753_at 




0 


206622_at 


1 70^0162^26109 


0 


209905_at 


-1 7R71 ftQ7 1064408 


0 


204069_at 




0 


209160_at 


-1.75500319419551 


0 


23581 8_at 


-1.70111545046162 


0 


223498_at 


1.67798456165549 - 


0 


2H 404j3_at 


1.67488607654784 


0 


209500_x_at 


1.66210516483391 


0 


203949_at 


-1.65967693892027 


0 


214875_x_at 


1 .65852470588382 


0 


56256_at 


1.09757984221605 


0 



decision limit gene symbol 
HOXA9 

3881.25 LGALS1 

ITGA9 

HOXA10 

WBSCR5 

TRH 
HOXA9 
MEIS1 
AKR1C3 



APLP2 

TNFSF13 

MPO 

APLP2 

LOC51092 



BNSDOCID: <WO 03039443A2_I_> 



847 



WO 03/039443 



PCT/EP02/12303 



CLL vs. all other 

accuracy 
confidence 
failed: 
gene 

224838_at 

239287_at 

22351 4_at 

44790_s_at 

212590_jat 

223287_s_at 

228390_at 

219471_at 

208456_s_at 

243780_at 

201998_at 

223391_at 

208091 _s_at 

225927_at 

202589_at 

229072_at 

229844_at 

21461 5_at 

230768_at 

224837_at 



samples: 32 / 237 

0.996282527881041 

1 

11 

signal-to-noise P 

2.72658457901213 0 

2.21787255431445 0* 

2.1294564890461 0 

2.04322734300157 0 

2.0395652222237 0 

2.01927297186102 0 

2.01736536531344 0 

2.0123024445387 0 

2.00649246957273 0 

1.99741020732395 0 

1.97818562004593 0 

1.974^9784627482 0 

1.95442592300049 0 

1.94521013683932 0 

-1.94332001129586 0 

1.94278755168998 0 

1.93701311168151 0 

1.91900370555866 0 

1.89700621262016 0 

1.88521474574714 0 



decision limit 

630.55 



gene symbol 



CARD11 
FLJ21562 

FOXP1 

FLJ21562 
RRAS2 

SIAT1 

LOC81537 

DKFZP564K0822 

MAP3K1 

TYMS 



P2Y10 



FOXP1 



BNSDOCID: <WO 03039443A2J_> 



848 



WO 03/039443 



PCT/EP02/12303 



CLL vs. CML 

accuracy 

confidence 

gene 

210254_at 

205557_at 

206111_at 

20961 9_at 

202503_s_at 

206871_at 

203949_at 

202589_at 

212268_at 

212531_at 

206676_at 

204670_x_at 

210334_x_at 

200654_at 

208306_x_at 

224838_at 

213572_s_at 

207269_at 

212750_at 

41577 at 



samples: 32/14 

1 

1 

signal-to-noise P 

-4.88339464327865 0* 

-4.67296996074416 0 

-4.55904506479679 0 

4.54717160557061 0 

-4.46030015155587 0 

-4.43465869214911 0 

-4.36878588501229 0 

-4.25959896335016 0 

-4.11824280019806 0 

-4.10884128760506 0 

-3.98482491918087 0 

3.97453356450998 0 

-3.95114669210453 0 

-3.73551197850169 0 

3.6021607716515 0 

3.57587529198941 0 

-3.56773642902922 0 

-3.5389139132598 0 

3.51249429987917 0 

3.13666804152596 0 



decision limit 

3718.45 



gene symbol 
BPI 

RNASE2 

CD74 

KIAA0101 

ELA2 

MPO 

TYMS 

SERPINB1 

LCN2 

CEACAM8 

HLA-DRB5 

BIRC5 

P4HB 

HLA-DRB4 

SERPINB1 
DEFA4 
PPP1R16B 
PPP1R16B 



849 



BNSDOCID: <WO_ 



_03039443A2_I 



WO 03/039443 



PCT/EP02/12303 



CLL vs. normal BWI 

accuracy 

confidence 

gene 

204776_at 

210613_s_at 

218662_s_at 

202503_s_at 

202580_x_at 

205051_s_at 

20201 8_s_at 

230988_at 

206871_at 

224975_at 

227230_s_at 

20971 4_s_at 

212531_at 

214575_s_at 

202589_at 

223785_at 

202705_at 

224976_at 

210052 s at 



samples: 32 / 9 

1 

1 

signal-to-noise p 

-4.80025506569938 0* 

-4.76199342875923 0 

-4.49242317171143 0 

-4.2272639947421 0 

-4.18939678293477 0 

-4.15661524563327 0 

-4.09501572722927 0 

-4.0486666186669 0 

-4.04477996703664 0 

-3.78270966714705 0 

-3.69195069014622 0 

-3.55884052493629 0 

-3.54256621978711 0 

-3.53838586077798 0 

-3.38277320357567 0 

-3.37348567040386 0 

-3.36111245088024 0 

-3.35971882977735 0 

-3.33990497740824 0 



decision limit 

163.3 



gene symbol 

THBS4 

SYNGR1 

HCAP-G 

KIAA0101 

FOXM1 

KIT 

LTF 

ELA2 

NFIA 

KIAA1211 

CDKN3 

LCN2 

A2U1 

TYMS 

FLJ10719 

CCNB2 

NFIA 

C20orf1 



BNSDOCID: <WO 03039443A2 J > 



850 



WO 03/039443 



•I 

PCT/EP02/12303 



CLL vs. ALLt(8;14) 


samples: 32/4 








accuracy 


1 








confidence 


1 








gene 


signal-to-noise 


P 


decision limit 


gene symbol 


202580_x_at 


-7.18159480292423 


0* 


218.2 


FOXM1 


209891_at 


-6.98895220029526 


0 




AD024 


227039_at 


3.70691051731571 


0 




AKAP13 


218663_at 


-3.68808195572391 


0 




HCAP-G 


36004_at 


3.51464925040429 


0 




1KB KG 


226936_at 


-3.44796352148406 


0 






22821 1_at 


3.34271829518438 


0 






218355_at 


-3.32490989672808 


0 




KIF4A 


201761_at 


-3.28097759120759 


0 




MTHFD2 


205837_s_at 


-3.18161701452463 


0 




GYPA 


210052_s_at 


-3.16637659493241 


0 




C20orf1 


202095_s_jat 


-3.11748537316183 


0 




BIRC5 


202503_s__at 


-3.09625623775405 


0 




KIAA0101 


212020_s_at 


-3.09152886109552 


0 




MKI67 


216833_x_at 


-3.08262837314566 


0 






228361_at 


-3.05435230901633 


0 






212022_s_at 


-3.01225907956052 


0 




MKI67 


227065_at 


3.00267996592052 


0 




COL5A2 


218039_at 


-2.9948995053224 


0 




ANKT 


36920_at 


2.10655917810938 


0 




MTM1 



BNSDOCID: <WO 03039443A2_I_> 



851 



WO 03/039443 



PCT/EP02/12303 



CLL vs. ALL B not Ph 

accuracy 

confidence 

gene 

225927_at 

213927_at 

201462_at 

224838_at 

213453_x_at 

239287_at 

208091 _s_at 

201998_at 

224848_at 

202863_at 

225592_at 

205805_s_at 

228390_at 

AFFX- 

HUMGAPDH/M33197_3_at 

HG-U133B 

220768_s_at 

221858_at 

214786_at 

224570_s_at 

202625_at 

AFFX-r2-Hs1 8SrRNA-3_s_at 
HG-U133B 



samples: 32 / 9 

1 

1 

signal-to-noise p 

3.15130564055876 0* 

2.64985304902657 0 

2.62143810029708 0 

2.5941283416844 0 

-2.18846578413759 0 

2.18590081196116 0 

2.12019333022318 0 

2.1165316341122 0 

-2.11049474415864 0 

2.08665988043685 0 

-2.07634399767076 0 

2.04616469387812 0 

2.03697588799482 0 



-2.02578116307399 0 

2.01657972292803 0 

2.00702836396881 0 

2.00594347120247 0 

-2.00161337456916 0 

1.99700437780915 0 

0.670536484443349 0 



decision limit 

1541.45 



gene symbol 
MAP3K1 

KIAA0193 

GAPD 

DKFZP564K0822 
SIAT.1 

SP100 

NRM 

ROR1 



GAPD 
CSNK1G3 
KIAA0608 
MAP3K1 

LYN 



852 



BNSDOCID: <WO_ 



_03039443A2_I_> 



WO 03/039443 



PCT/EP02/12303 



CLL vs. ALL Ph 

accuracy 

confidence 

gene 

204663_at 

203373_at 

225927_at 

22351 4_at 

210487_at 

213927_at 

237337_at 

236280_at 

227900_at 

211709_s_at 

224833_at 

226545_at 

208091_s_at 

235674_at 

234107_s_at 

223462_at 

201462_at 

223391_at 

212719 at 



samples: 32/15 

1 

1 

signal-to-noise p 

-3.923285216392 0* 

-3.78247945374324 0 

3.05858766321181 0 

2.79698500849851 0 

-2.66400596339357 0 

2.64985304902657 0 

2.52705433891484 0 

2.49746424904494 0 

2.49040043479395 0 

-2.45092080989795 0 

2.40705165058966 0 

-2.40457551567713 0 

2.38274758846201 0 

2.37963455734461 0 

2.36302665773918 0 

2.32337125093609 0 

2.3212369641096 0 

2.30224083750022 0 

-2.29194173393375 0 



decision limit 

80.55 



gene symbol 

ME3 
STATI2 
MAP3K1 
CARD 11 
DNTT 



SCGF 
ETS1 

DKFZP564K0822 



MGC4618 
KIAA0193 
LOC81537 
SCOP 



853 



BNSDOCID: <WO_ 



_03039443A2J_> 



WO 03/039443 



PCT/EP02/12303 



CLL vs. T-ALL 

accuracy 

confidence 

gene 

201417_at 

201416_at 

221858_at 

213772_s_at 

224932_at 

224847_at 

201462_at 

224848_at 

243780_at 

227900_at 

202625_at 

228390_at 

201998_at 

226545_at 

204446_s_at 

204192_at 

219076_s_at 

206398_s_at 

224838_at 

212827 at 



samples: 32 / 9 

1 

1 

signal-to-noise p 

-3.83747798152129 0* 

-3.69716215884899 0 

2.73818494452306 0 

2.72201152228286 0 

2.6366912338985 0 

-2.63602874545103 0 

2.62143810029708 0 

-2.60776696130397 0 

2.49991585531364 0 

2.49040043479395 0 

2.4377901797746 0 

2.41659743191322 0 

2.3963Q335572289 0 

-2.3815195487075 0 

2.37716091038765 0 

2.34749349754959 0 

-2.31227318629435 0 

2.30227312148638 0 

2.29208011575118 0 

2.28758794629918 0 



decision limit 

2230.95 



gene symbol 

SOX4 
KIAA0608 
GGA2 
PRSS2 

KIAA0193 



LYN 

SIAT1 

ALOX5 
CD37 
PXMP2 
CD19 

IGHM 



BMSOOCID: <WO Q3039443A2J_> 



854 



WO 03/039443 



PCT/EP02/12303 



/-*! | A Kill J.Q 

CLL VS. AMU +o 


samples, / iu 






accuracy 


i 






confidence 


1 






gene 


signal-to-noise 


P 


Hopicion limit /iano cumhrkl 
Ucuioiuii mint ycl it; oyiiiuwi 


224838_at 


o.1byoo4U4o/^l To 


u 




22351 4_at 


2. /9oyoo00o4yoo1 


u 


PARm 1 


212827_at 


2.0b/ bo/ /oUUUoo 


n 
u 


i V3 n i vi 


201200_at 


-2.52QooUQoo4u4o 


U 




242633_x_at 


2, ooOoo/o / 4oo1 b / 


n 
u 




208456_s_at 


2.341 06774724323 


U 


DDAQO 


212590_at 


2.3295599530646 


0 




44790_s_at 


2.3285406901 4676 


0 


CI IOi KCO 

rLJ2lob2 


219471_at 


2.27189811261511 


0 


rl_J21 DoZ 


202441 _at 


-2 . 269 1 54 ( 5899 159 


0 


l\tU4 


209374_s_at 


2.263527554631 27 


U 


Ivanlvl 


243780_at 


2.25989352775893 


0 




20421 5_at 


2.24311505389492 


0* 


906.65 MGC4175 


223382_s_at 


-2.21844659554677 


0 


NIN283 


201163_s_at 


-2.2061831531108 


0 


IGFBP7 


224837_at 


2.17714293481922 


0 


FOXP1 


223391_at 


2.1733535915887 


0 


LOC81537 


20809 1_s_at 


2.17150813050761 


0 


DKFZP564K0822 


205933_at 


2.16573909337041 


0 


SETBP1 


239287_at 


2.1598880074389 


0 





855 



BNSDOCID: <WO 03039443A2J_> 



WO 1)3/039443 



PCT/EP02/12303 



CLL vs. AML complex 

accuracy 

confidence 

gene 

224838_at 

235674_at 

243780_at 

239287_at 

223514_at 

222680_s_at 

20961 9_at 

202503_s_at 

208456_s_at 

208864_s_at 

212590_at 

201200_at 

212827_at 

224837_at 

205805_s_at 

244261_at 

223287_s_at 

208091 _s_at 

226989_at 

236854 at 



samples: 32 / 36 

1 

1 

signal-to-noise p 

2.78969388721613 0 

2.37963455734461 0 

2.32051881321218 0 

2.24074490580899 0 

2.18755403591948 0 

-2.18354473266858 0* 

2.13206191976271 0 

-2.11188972263141 0 

2.10620770802115 0 

-2.10162761752294 0 

2.09310293747601 0 

-2.05492899324696 0 

2.054^2625446282 0 

2.04951179193956 0 

2.04616469387812 0 

2.03144018714413 0 

2.0252763158281 0 

2.0139098675012 0 

2.00631003964875 0 

1.99292097643003 0 



decision limit gene symbol 



203.95 



CARD1 1 

RAMP 

CD74 

KIAA0101 

RRAS2 

TXN 

CREG 
IGHM 
FOXP1 
ROR1 

FOXP1 

DKFZP564K0822 



856 



BNSDOCID: <WO_ 



_03039443A2_I_> 



WO 03/039443 



PCT/EP02/ 12303 



CLL vs. AML normal 

accuracy 
confidence 
failed: 
gene 

224838_at 

243780_at 

22351 4_at 

201200_at 

201998_at 

44790_s_at 

205049_s_at 

239287_at 

223287_s_at 

223391_at 

219471_at 

225175_s_at 

20421 5_at 

236280_at 

208456_s_at 

206398_s_at 

212827_at 

228390_at 

229072_at 

214615 at 



samples: 32 / 62 

0.98936170212766 

1 

11 

signal-to-noise p 

2.80936017892095 0* 

2.42900650257202 0 

2.40431982389629 0 

-2.28363968538296 0 

2.19987941765164 0 

2.18842053042545 0 

2.18662155440889 0 

2.17948863877979 0 

2.16628667951355 0 

2.16298199717166 0 

2.13618332752253 0 

2.12811708839772 0 

2.11642877909845 0 

2.11271003733104 0 

2.10890038745727 0 

2.09984081904119 0 

2.09850222830603 0 

2.09600254213173 0 

2.09351406943008 0 
2.06329249104139 ' 0 



decision limit 

2987.7 



gene symbol 



CARD11 

CREG 

SIAT1 

FLJ21562 

CD79A 

FOXP1 

LOC81537 

FLJ21562 

CTL2 

MGC4175 

RRAS2 

CD19 

IGHM 



P2Y10 



857 



BNSDOCID: <WO_ 



_03039443A2J_> 



WO 03/039443 



PCT/EP02/12303 



CLL vs. AML t(8;21) 

accuracy 

confidence 

gene 

203949_at 

224838_at 

205192_at 

211709_s_at 

212827_at 

230768_at 

206009_at 

208091 _s_at 

208456_s_at 

215215_s_at 

203948_s_at 

201811_x_at 

243780_at 

212590_at 

227900_at 

226546_at 

225927_at 

212589_at 

228390_at 

225246 at 



samples: 32/13 

1 

1 

signal-to-noise p 

-3.50330789770673 0 

3.25663434332492 0 

3.21758439151756 0* 

-2.95109939287116 0 

2.84469338495376 0 

2.8154401362934 0 

-2.7972720292112 0 

2.63085873400942 0 

2.62810675193075 0 

-2.60039836996497 0 

-2.56960807932381 0 

2.56895859687937 0 

2.51082778037667 0 

2.50058834205242 0 

2.49040043479395 0 

-2.44432895386159 0 

2.44085004959256 0 

2.43561411833392 0 

2.41659743191322 0 

2.39661265710521 0 



decision limit gene symbol 
MPO 



113.25 



MAP3K14 

SCGF 

IGHM 

ITGA9 

DKFZP564K0822 
RRAS2 

MPO 
SH3BP5 



MAP3K1 
RRAS2 

STIM2 



858 



BNSDOCID: <WO_ 



_03O39443A2_l_> 



WO 03/039443 



PCT/EP02/12303 



CML vs. all other 

accuracy 
confidence 
failed: 
gene 

212531_at 

205557_at 

206676_at 

209772_s_at 

207802_at 

216379_x_at 

209771_x_at 

203936_s_at 

211657_at 

203757_s_at 

210254_jat 

20201 8_s_at 

211275_s_at 

20551 3_at 

205863_at 

223839_s_at 

210244_at 

206440_at 

207269_at 

204174 at 



samples: 14/255 
0.981412639405205 
0.984251851969928 
4,6,10,14,115 

signal-to-noise p 

2.20686225450673 0 

2.19817622063178 0* 

2.01751329912518 0 

1.89825970350824 0* 

1.84335905687648 0 

1.76253321043545 0 

1.75699226119665 0 

1.75554347155831 0 

1.69149085236384 0 

1.6780512743379 0 

1.6590320804534 0 

1.64680728641866 0 

1.62477051547664 0 

1.60415014432038 0 

1.60339454990244 0 

1.57424844312026 0 

1.56058501019954 0 

1.5590538838617 0 

1.55083422089021 0 
1.53747851176966 1 0 



decision limit gene symbol 
LCN2 

11581.65 BPI 

CEACAM8 

3429.4 CD24 
SGP28 

CD24 
MMP9 

CEACAM6 

LTF 
GYG 
TCN1 
S100A12 

CAMP 
VELI1 
DEFA4 
ALOX5AP 



BMSDOCID: <WO 03039443A2_I_> 



859 



WO 03/039443 PCT/EP02/1 2303 



CML vs. normal BM 

accuracy 

confidence 

gene 

227198_at 

204562_at 

227749_at 

228377_at 

22671 3_at 

20961 9_at 

227375_at 

204057_at 

201506_at 

221558_s_at 

227867_at 

225331_at 

223280_x_jat 

243780_at 

228055_at 

204670_x_at 

208683_at 

210356_x_at 

209670_at 

AFFX- 

HUMGAPDH/M331 97_3_at 
HG-U133B 



samples: 14/9 

1 

1 

signal-to-noise 

-3.96995770729437 

-3.83910818190111 

-3.11980636745679 

-2.65145492785893 

-2.28229285614419 

-2.27384420268273 

-2.22184735546056 

-2.13119125485664 

-2.05058862618543 

-2.0055054625083 

-2.00414646646454 

-1.95134035825926 

-1.95125537387335 

-1.94601159348593 

-1.94418493221925 

-1.94197079510435 

-1.93662356244796 

-1.92014292171279 

-1.89146017871026 



1.0887744230798 



P 

0* 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 



decision limit 

39.95 



gene symbol 
IRF4 

KIAA1384 
CD74 

DKFZP566D1346 

ICSBP1 

TGFBI 

LEF1 

MS4A6A 



HLA-DRB5 
CAPN2 
MS4A2 
TRA 



GAPD 



BNSDOCID: <WO 03039443A2 J_> 



860 



WO 03/039443 



PCT/EP02/12303 



CMLvs. ALL t(8;14) 

accuracy 

confidence 

gene 

231215_at 

212400_at 

239835_at 

214439_x_at 

227173_s_at 

20961 9_at 

234660_s_at 

202723_s_at 

227700_x_at 

219202_at 

204638_at 

222147_s_at 

213772_s_at 

203932_at 

205145_s_at 

221969_at 

217521_at 

205541 _s_at 

204661 _at 

242729 at 



samples: 14/4 

1 

1 

signal-to-noise p 

-3.69904177219169 0* 

-3.60679086659289 0 

-3.24273732421016 0 

-3.01379256143662 0 

-2.97620214103741 0 

-2.86528990843459 0 

-2.81161728678359 0 

-2.69572816449622 0 

-2.59859831428636 0 

-2.52442131777509 0 

-2.51835721417786 0 

-2.47042047428454 0 

-2.41 552637854988 0 

-2.39804753895984 0 

-2.34102992713809 0 

-2.278395672233 0.01 

2.26710710056045 0 

-2.22303735713359 0 

-2.15814023268535 0 

2.02602724250151 0 



decision limit 

17.25 



gene symbol 



KIAA1842 

BIN1 

BACH2 

CD74 

DIS3 

FOXOIA 

FLJ 10709 

FLJ22341 

ACP5 

GGA2 
HLA-DMB 
MYL5 
PAX5 

GSPT2 
CDW52 



BNSDOCID: <WO 03039443A2_I_> 



861 



WO 03/039443 



PCT/EP02/1 2303 



CML vs. ALL B not Ph 

accuracy 

confidence 

gene 

206440_at 

223839_s_at 

201482_at 

223246_s_at 

202477_s_at 

22741 5_at 

20281 9_s_at 

226876_at 

200654_at 

231472_at 

217521_at 

218764_at 

219615_s_at 

219111_s_at 

212268_at 

211275_s_at 

211990_at 

206676_at 

23581 8_at 

210254 at 



samples: 14/9 

1 

1 

signal-to-noise p 

3.54851691296324 0* 

2.95162251991881 0 

2.5967465282815 0 

-2.53289104519377 0 

-2.36949525800043 0 

-2.35663515228965 0 

-2.31208048847973 0 

2.29929647597952 0 

2.2916645921996 0 

-2.27133126588153 0 

2.26710710056045 0 

-2.26149206068996 0 

2.21717677908473 0 

-2.20588673666352 0 

2.20005764504112 0 

2.1595066613388 0 

-2.15004013194548 0 

2.13116752502457 0 

2.13111499669481 0 

2.07581294141343 0 



decision limit gene symbol 
69.75 VELI1 

QSCN6 
STRBP 
GCP2 

TCEB3 

P4HB 



MGC5363 

KCNK5 

MGC2835 

SERPINB1 

GYG 

HLA-DPA1 
CEACAM8 



BNSDOCID: <WO 03039443A2J_> 



862 



WO 03/039443 PCT/EP02/1 2303 



CML vs. ALL Ph 

accuracy 

confidence 

gene 

20961 9_at 

212778_at 

201029_s_at 

214317_x_at 

202332_at 

211990_at 

203373_at 

210487_at 

226844_at 

207971_s_at 

206440_at 

20551 3_at 

211275_s_at 

204670_x_at 

205557_at 

220000_at 

212531_at 

204174_at 

225386_s_at 

206111_at 



samples: 14/15 

1 

1 

signal-to-noise p 

-3.47132181247921 0* 

-3.09966497846595 0 

-3.08460906542498 0 

-3.02953000176234 0 

-2.9539112535883 0 

-2.80086525597245 0 

-2.76056827822619 0 

-2.66400596339357 0 

-2.53578669067924 0 

-2.4877090389281 0 

2.45272667930868 0 

2.38968722893052 0 

2.33552678870234 0 

-2.32905695313267 0 

2.3286749741246 0 

2.32132983989275 0 

2.31169568945083 0 

2.3066969794321 0 

2.25229241478444 0 

2.22277368410809 0 



decision limit 

5777.85 



gene symbol 

CD74 

KIAA0602 

MIC2 

RPS9 

CSNK1E 

HLA-DPA1 

STATI2 

DNTT 

KIAA0582 
VELI1 
TCN1 
GYG 

HLA-DRB5 
BP! 

SIGLEC5 

LCN2 

ALOX5AP 

LOC92906 

RNASE2 



BNSDOCID: <WO 03039443A2_L> 



863 



WO 03/039443 PCT/EP02/1 2303 



CML vs. T-ALL 

accuracy 

confidence 

gene 

210254_at 

206111_at 

205557_at 

216379_x_at 

209771_x_at 

204484_at 

203949_at 

206871_at 

212268_at 

239071_at 

209772_s_at 

212531_at 

206440_at 

242292_at 

206676_at 

218224_at 

211657_at 

231215_at 

211275_s_at 

203757 s at 



samples: 14/9 

1 

1 

signal-to-noise p 

4.72390509810889 0* 

4.12411957901486 0 

4.12268252311326 0 

4.09576684675689 0 

4.04268115620178 0 

-3.92079658637056 0 

3.82080923209307 0 

3.79666500221894 0 

3.66762573362107 0 

-3.65785660037409 0 

3.61720812003475 0 

3.55056769994448 0 

3.54851691296324 0 

-3.53231266693437 0 

3.51041964854693 0 

-3.28568428621792 0 

3.27814094785335 0 

-3.27770330337019 0 

3.26113496045954 0 

3.16440848129156 0 



decision limit 

3562.85 



gene symbol 

RNASE2 
BPI 

CD24 

PIK3C2B 

MPO 

ELA2 

SERPINB1 

CD24 
LCN2 
VELI1 

CEACAM8 
PNMA1 



GYG 

CEACAM6 



BNSDOCID: <WO 03039443A2_I_> 



864 



WO 03/039443 



PCT/EP02/12303 



CML vs. AML +8 

accuracy 

confidence 

gene 

212531_at 

218151_x_at 

203188_at 

22501 4_at 

223894_s_at 

227999_at 

210244_at 

223226_x_at 

205849_s_at 

203936_s_at 

202080_s_at 

212146_at 

210749_x_at 

220755_s_at 

211743_s_at 

214317_x_at 

2l4475_x_at 

200094_s_at - HG-U133A 

223939_at 

229934 at 



samples: 14/10 

1 

1 

signal-to-noise p 

3.91758233758871 0* 

-3.61342068798238 0 

-2.49453997369644 0 

-2.46286744087574 0 

2.42404553998791 0 

-2.41422896946156 0 

2.30650676975445 0 

-2.1556086960975 0 

-2.15471806280279 0 

2.12656987689311 0 

-2.08086044698338 0 

-2.00720003739628 0 

-2.00702022115248 0 

-1.97451125809376 0 

1.88718047009153 0 

-1.86316203101213 0 

1.8601686492104 0 

-1.83127854302456 0 

1.81572736683912 0 

1.65483415989943 0 



decision limit 

6654.5 



gene symbol 
LCN2 
FLJ11856 
B3GNT6 

FTS 

LOC1 70394 

CAMP 

MGC3181 

UQCRB 

MMP9 

KIAA1042 

KIAA0842 

DDR1 

PRG2 

RPS9 

CAPN3 

EEF2 

GPR91 



865 



BNSDOCID: <WO_ 



_03039443A2J_> 



WO 03/039443 



PCT/EP02/12303 



CML vs. AML complex 

accuracy 
confidence 
failed: 
gene 

209772_s_at 

212531_at 

210244_at 

205557_at 

207802_at 

203936_s_at 

206676_at 

216379_x_at 

209771_x_at 

202265_at 

209186_at 

201029_s_at 

206440_at 

218144_s_at 

212783_at 

225065_x_at 

214317_x_at 

212232_at 

223894_s_at 

207269 at 



samples: 14/36 

0.98 

1 

19 

signal-to-noise p 

2.25515756155772 0 

2.16780856655461 0* 

2.05958595591104 0 

1.88114022725824 0 

1.84224616530014 0 

1.83245146841284 0 

1.82688412686061 0 

1.82175067686641 0 

1.71430561727442 0 

-1.71011447772371 0 

-1.65481819707492 0 

-1.6407978740451 0 

1.62023824679434 0 

-1.60220955278722 0 

-1.60173956589805 0 

1.5634162479603 0 

-1.51909514839257 0 

-1.51499149327941 0 

1.50378092313755 0 

1.49292374711871 0 



decision limit 

10028.8 



gene symbol 

CD24 

LCN2 

CAMP 

BPI 

SGP28 
MMP9 
CEACAM8 

CD24 

BMI1 

ATP2A2 

MIC2 

VELI1 

FLJ22056 

DKFZp761B2423 

RPS9 
KIAA1014 
FTS 
DEFA4 



866 



BNSDOCID: <WO_ 



_03039443A2J_> 



WO 03/039443 



PCT/EP02/12303 



CML vs. AML normal 

accuracy 

confidence 

gene 

212531_at 

209772_s_at 

207802_at 

205557_at 

206676_at 

20551 3_at 

209771_x_at 

216379_x_at 

211657_at 

203757_s_at 

20201 8_s_at 

210254_at 

203021_at 

205863_at 

203936_s_at 

210244_at 

204351_at 

211275_s_at 

224967_at 

223894 s at 



samples: 14/62 

1 

1 

signal-to-noise p 

3.52937454592387 0* 

2.62224687740926 0 

2.60464949585023 0 

2.59981269961356 0 

2.51996784029752 0 

2.39713364221477 0 

2.3694290541236 0 

2.36313392615064 0 

2.35374478763084 0 

2.29281876629007 0 

2.19711362051377 0 

2.16927497946765 0 

2.12970551783084 0 

2.10812748771569 0 

2.07272259731953 0 

2.03624546792529 0 

2.0071543551955 0 

1.97937290056753 0 

1.96229148265436 0 

1.92953251923329 0 



decision limit 

9007.35 



gene symbol 

LCN2 

CD24 

SGP28 

BPI 

CEACAM8 

TCN1 

CD24 



CEACAM6 
LTF 

SLPI 

S100A12 

MMP9 

CAMP 

S100P 

GYG 



FTS 



BNSDOCID: <WO 03039443A2J_> 



867 



WO 03/039443 



PCT/EP02/ 12303 



fcMLvs. AML t(8;21) 

accuracy 

confidence 

gene 

207802_at 

201425_jat 

201029_s_at 

205653_at 

202391_at 

20961 9_at 

228827_at 

204670_x_at 

212531_at 

203936_s_at 

225386_s_at 

226878_at 

201015_s_at 

201281_at 

210982_s_at 

210254_at 

201 1 37_s_at 

202545_at 

228485_s_at 

231969 at 



samples: 14/13 

1 

1 

signal-to-noise p 

2.83774385104531 0 

2.50825834074572 0 

-2.22223571455337 0* 

2.20106889451589 0 

2.10168460148023 0 

-2.0547003400447 0 

-2.03460798747208 0 

-1.98893997185408 0 

1.96994267101106 0 

1.92996861752981 0 

1.90963950502737 0 

-1.90647418854412 0 

-1.90408812555782 0 

-1.88812152152812 0 

-1.87330616648773 0 

1.87025014628174 0 

-1.84763478416102 0 

1.81501795739179 0 

1.81223837093458 0 

1.61927333154015 0 



decision limit 



3078.2 



gene symbol 

SGP28 

ALDH2 

M1C2 

CTSG 

BASP1 

CD74 

HLA-DRB5 
LCN2 
MMP9 
LOC92906 

JUP 

ADRM1 

HLA-DRA 

HLA-DPB1 

PRKCD 

CDW92 

DKFZp762K222 



BMSDOCID: <WO 03039443A2_I_: 



868 



WO 03/039443 



PCT/EP02/12303 



normal BM vs. all other 


samples: 9 / 260 








accuracy 


0. 970260223048327 








confidence 


1 








failed: 


1,3,4,5,7,8,9,234 








gene 


signal-to-noise 


P 


• _ • • j _ ft ■ * * 

decision limit 


gene symt 


20201 8_s_at 


1 .7001 4923965644 


0 




LTF 


231241_at 


1 .68875870396623 


0 






212531_at 


1.61671894194992 


0 




LCN2 


210244_at 


1.4725412229519 


0 




CAMP 


244652_at 


1 .41 007652726897 


0 






218916_at 


-1 .40959768522889 


0 




FLJ23436 


208141_s_at 


-1 .39496085050899 


0 




MGC4293 


218516_s_at 


-1 .39489208362807 


0 




FLJ20421 


224976_at 


1 .38035857849962 


0 




NFIA 


201396_s_at 


-1.35501592670165 


0 




SGT 


230988_at 


1.27806474591682 


0 






200631_s_at 


-1.27257367942543 


0 




SET 


224975_at 


1.25475272090295 


0 




NFIA 


203535_at 


1.24698765220296 


0 




S100A9 


227497_at 


1.24053728574653 


0* 


317.95 




225792_at 


1.2371690464102 


0 






203367_at 


-1.23177778244874 


0 




DUSP14 


40189_at 


-1.22367056036721 


0 




SET 


225700_at 


1.22315772266762 


0 






223785_at 


1.22097206282821 


' 0 




FLJ10719 



BNSDOCID: <WO 03039443A2J_> 



869 



WO 03/039443 



PCT/EP02/12303 



normal BM vs. ALL t(8;14) 

accuracy 

confidence 

gene 

222147_s_at 

209380_s_at 

230988_at 

227230_s_at 

201278_at 

205051_s_at 

214558_at 

201988_s_at 

223253_at 

204285_s_at 

237006_at 

217047_s_jat 

243020_at 

205690_s_at 

234764_x_at 

209160_at 

239835_at 

242434_at 

228708_at 

229575 at 



samples: 9 / 4 

1 

1 

signal-to-noise p 

-5.80632286389653 0* 

4.30105342509009 0 

4.0486666186669 0 

3.69195069014622 0 

3.59730622800604 0 

3.55673529468534 0 

3.49255759470258 0 

3.46251302734757 0 

3.45322097101822 0 

-3.35570555370453 0 

3.14219078440997 0 

3.00448756016945 0 

2.97861252444615 0 

-2.97763156027721 0 

2.9597816169315 0 

2.88288854646291 0.01 

-2.82125904680112 0 

-2.60351 76441 0894 0.01 

2.60104548908002 0.01 

2.59826531472399 0 



decision limit 

48.2 



gene symbol 

ABCC5 

KIAA1211 

DAB2 

KIT 

GPR12 
CREBL2 
UCC1 
PMAIP1 



G10 

AKR1C3 
KIAA1842 



BNSDOCID: <WO 03039443A2J_> 



870 



WO 03/039443 



PCT/EP02/12303 



normal BM vs. ALL B not Ph 

accuracy 

confidence 

gene 

201506_at 

218718_at 

205051 _s_at 

23581 8_at 

202382_s_at 

221802_s_at 

238066_at 

203645_s_at 

227038_at 

225792_at 

202729_s_at 

230441_at 

212989_at 

201462_at 

227627_at 

204112_s_at 

204285_s_at 

223044_at 

223839_s_at 

204225 at 



samples: 9/9 
1 

1 , 

signal-to-noise p 

3.72288560994884 0* 

3.24294310152425 0 

3.03451031560789 0 

2.5546546541523 0 

2.39296453276424 0 

2.32414568373485 0 

2.17647785634866 0 

2.13814112768782 0 

2.09057615422001 0 

2.07680085776^85 0 

2.04654552970378 0 

-2.03717805375485 0 

2.00592033097705 0 

1.98840399134665 0 

1.97570787395888 0 

1.90974025156521 0 

-1.90604496284052 0 

1.88064158446644 0 

1.86406616667336 0 

1.85888952621093 0 



decision limit 

464.35 



gene symbol 
TGFBI 
PDGFC 
KIT 

GNPI 
KIAA1598 
RBP7 
CD163 



LTBP1 



KIAA0193 

SGKL 

HNMT 

PMAIP1 

SLC11A3 

HDAC4 



871 



BNSDOCID: <WO__ 



_03039443A2_I 



WO 03/039443 



PCT/EP02/12303 



normal BM vs. ALL Ph 

accuracy 

confidence 

gene 

225792_at 

224976_at 

227230_s_at 

203373_at 

202332_at 

212778_at 

230988_at 

218718_at 

205624_at 

224975_at 

234107_s_at 

217988_at 

202945_at 

218916_at 

223044_at 

226751_at 

210487_at 

201029_s_at 

207971_s_at 

203645 s at 



samples: 9/15 

1 

1 

signal-to-noise p 

5.04037264761662 0* 

4.534006778733 0 

3.69195069014622 0 

-3.60116056485263 0 

-3.19705046805115 0 

-3.09966497846595 0 

3.03359562676407 0 

3.0129378014466 0 

2.79751505147013 0 

2.78377945957649 0 

2.7482082597836 0 

-2.7439427550045 0 

-2.67000372671022 0 

-2.6636521234692 0 

2.59882500535427 0 

2.41273912703748 0 

-2.38376037071424 0 

-2.37712949687212 0 

-2.36738543730614 0 

2.32470988727879 0 



decision limit 

76.45 



gene symbol 

NFIA 

KIAA1211 

STATI2 

CSNK1E 

KIAA0602 

PDGFC 

CPA3 

NFIA 

HEI10 

FPGS 

FLJ23436 

SLC11A3 

DKFZP566K1924 

DNTT 

MIC2 

KIAA0582 

CD163 



872 



BMSOOCID: <WO_ 



_03039443A2„I_> 



WO 03/039443 



PCT/EP02/12303 



normal BM vs. T-ALL 

accuracy 

confidence 

gene 

231241_at 

225792_at 

224976_at 

204484_at 

218516_s_at 

244652_at 

210613_s_at 

206488_s_at 

201416_at 

201417_at 

218224_at 

206871_at 

214575_s_at 

221188_s_at 

203104_at 

226190_at 

20201 8_s_at 

200631_s_at 

224970_at 

212531 at 



samples: 9 / 9 

1 

1 

signal-to-noise p 

5.25713497545753 0* 

5.04037264761662 0 

4.534006778733 0 

-3.92079658637056 0 

-3.89821124480393 0.01 

3.8571916430704 0 

3.6441364713354 0 

3.59792987181253 0 

-3.57631738716755 0 

-3.57222567838335 0 

-3.56533208466192 0 

3.49320660549908 0 

3.38517251606938 0 

3.26852421442154 0 

3.11161266045758 0 

3.08925054797494 0 

3.06471974822162 0 

-3.05905666007645 0 

3.05809311507561 0 

3.03461908631694 0 



decision limit 

52.3 



gene symbol 



NFIA 

PIK3C2B 

FLJ20421 

SYNGR1 

CD36 

SOX4 

PNMA1 

ELA2 

AZU1 

CIDEB 

CSF1R 

LTF 
SET 
NFIA 
LCN2 



873 



BNSDOCID: <WO_ 



_03039443A2J_> 



WO 03/039443 



PCT/EP02/12303 



normal BM vs. AML +8 

accuracy 

confidence 

gene 

231241_at 

218916_at 

218151_x_at 

212531_at 

22501 4_at 

210244_at 

227999_at 

220755_s_at 

202945_at 

214109_at 

225792_at 

238066_at 

217179_x_at 

200631_s_at 

234764_x_at 

225547_at 

205849_s_at 

208141_s_at 

233613 x at 



samples: 9/10 

1 

1 

signal-to-noise p 

5.25713497545753 0* 

-4.40926777727725 0 

-3.61342068798238 0 

3.35638912536087 0 

-2.46286744087574 0 

2.4347108722936 0 

-2.41422896946156 0 
-2.40201593179233 " 0 

-2.37212917164108 0 

2.28939588941522 0 

2.20832306168575 0 

2.17647785634866 0 

2.173252865121 0 

-2.1126574466818 0 

2.11173306630787 0 

-2.10764119746987 0 

-2.04438654979845 0 

-2.04344739454244 0 

2.02519262390017 0 



decision limit 

52.3 



gene symbol 

FLJ23436 
FLJ11856 
LCN2 

CAMP 
LOG 170394 

FPGS 
LRBA 

RBP7 

SET 



UQCRB 
MGC4293 



BNSDOCID: <WO 03039443A2J_> 



874 



WO 03/039443 PCT/EP02/1 2303 



normal BM vs. AML complex 

accuracy 
confidence 
failed: 
gene 

231241_at 

210244_at 

226734_at 

218516_s_at 

244652_at 

201595_s_at 

212531_at 

201396_s_at 

203092_at 

230795_at 

228377_at 

208141_s_at 

202265_at 

20201 8_s_at 

218916_at 

224406_s_at 

209806_at 

208645_s_at 

200040_at - HG-U133A 

AFFX-HSAC07/X00351JVLat - 

HG-U133A 



samples: 9/36 
1 

15 

signal-to-noise p 

2.20113810647196 0* 

2.15797898100991 0 

2.10647394623844 0 

-2.0003648205556 0 

1.9350839662771 0 

-1.81799799553143 0 

1.80637620431712 0 

-1.79606593499252 0 

-1.77664454556306 0 

-1.75873906279717 0 

1.70396224015785 0 

-1.66429492261381 0 

-1.62376492554641 0 

1.60476985758804 0 

-1.58708293547372 0 

1.57633156642315 0 

-1.57424329592102 0 

1.5695109709269 0 

-1.548128254162 0 

1.11476993738169 0 



decision limit gene symbol 

94.5 

CAMP 

FLJ20421 

HT010 

LCN2 

SGT 

TIMM44 

H4F2 

KIAA1384 

MGC4293 

BMI1 

LTF 

FLJ23436 

IRTA2 

H2B/S 

KHDRBS1 

ACTB 



BNSDOCID: <WO 03039443A2_I_> 



875 



WO 03/039443 



PCT/EP02/12303 



normal BM vs. AML normal 


samples: 9/62 








accuracy 


1 








confidence 


1 








gene 


signal-to-noise 


P 


decision limit 


gene symbol 


212531_at 


3.03296310695958 


0* 


8390.45 


LCN2 


20201 8_s_at 


2.55569861815605 


0 




LTF 


210244_at 


2.12911144557531 


0 




CAMP 


231241_at 


1.90756814920388 


0 






234764_x_at 


1.88445719282136 


0 






238066_jat 


1 .84490304262637 


0 




RBP7 


218516_s_at 


-1.76767038599239 


0 




FLJ20421 


214651_s_at 


-1.75409579509524 


0 




HOXA9 


207802_at 


1.69982859144728 


0 




SGP28 


203535_at 


1.6861966835666 


o ■ 




S100A9 


224976_at 


1.68580070345942 


0 




NFIA 


23581 8_at 


1.65829753113564 


0 






226188_at 


1.63500287325021 


0 






208651 _x_at 


1.6330531805719 


0 




CD24 


214523_at 


1.61056786968333 


0 




CEBPE 


2Q6871_at 


1.60161598002874 


0 




ELA2 


224970_at 


1.57809657249869 


0 




NFIA 


210004_at 


1.57705611671231 


0 




OLR1 


208141_s_at 


-1.55515435811455 


0 




MGC4293 


215379 x_at 


1.53008885935316 


0 




IGLJ3 



BMSDOCID: <WO 03039443A2_I_> 



876 



WO 03/039443 PCT/EP02/12303 



normal BM vs. AML t(8;21) 

accuracy 

confidence 

gene 

225792_at 

224976_at 

201425_at 

201506_at 

208146_s_at 

224970_at 

206488_s_at 

22681 8_at 

224975_at 

227388_at 

217963_s_at 

23361 3_x_at 

228766_at 

208908_s_at 

212586_at 

221802_s_at 

224823_at 

213624_at 

221731_x_at 

225426 at 



samples: 9/13 
1 

1 , 

signal-to-noise p 

5.04037264761662 0* 

4.534006778733 0 

3.96306688659586 0 

3.72288560994884 0 

3.28524851571637 0 

3.05809311507561 0 

2.8751017477522 0 

2.67802773991406 0 

2.55138740597295 0 

2.49815087819486 0 

2.48471435523086 0 

2.45189729191801 0 

2.45082385023982 0 

2.41879730006023 0 

2.33228373164484 0 

2.32414568373485 0 

2.30113027005505 0 

2.29634946162627 0 

2.26796857130761 0 

2.26611296861875 0 



decision limit 

76.45 



gene symbol 

NFIA 

ALDH2 

TGFBI 

CPVL 

NFIA 

CD36 

NFIA 

HCS 



CAST 

ARTS-1 

KIAA1598 

ASM3A 
CSPG2 



BNSOOCID: <WO 03039443A2_I_> 



877 



WO 03/039443 



PCT/EP02/12303 



A'LL t(8;14) vs. ALL Ph 

accuracy 

confidence 

gene 

204663_at 

203373_at 

207971_s_at 

221834_at 

212535_at 

210487_at 

226607_at 

214505_s_at 

226545_at 

201540_at 

209253_at 

228496_s_at 

213854_at 

22821 1_at 

212012_at 

20821 7_at 

20251 9_at 

218836_at 

211031_s_at 

50277 at 



samples: 4/15 

1 

1 

signal-to-noise p 

-3.923285216392 0* 

-3.79361888147031 0 

-3.75632756767036 0 

-3.73210732395713 0 

-3.35180803492209 0 

-2.66400596339357 0 

-2.65303285070543 0 

-2.30002142127038 0 

-2.26350222846019 0 

-2.21889045310099 0 

-2.17983750831107 0 

-2.17784383479245 0 

-2.14963630962421 0 

-2.14770747938169 0 

-2.10857376713896 0 

-2.10769716540219 0 

-2.0579870997007 0 

2.05747195088222 0 

-2.05686503189171 0 

1.4804958933411 0 



decision limit 

80.55 



gene symbol 
ME3 
STATI2 
K1AA0582 



DNTT 

L3MBTL 

FHL1 

FHL1 
SCAM-1 
CRIM1 
SYNGR1 

D2S448 

GABRR2 

MONDOA 

FLJ22638 

CYLN2 

GGA1 



880 



BNSDOCID: <WO_ 



_ 03039443A2_I_> 



WO 03/039443 



PCT/EP02/12303 



ALL t(8;14) vs. T-ALL 

accuracy 

confidence 

gene 

213772_s_at 

23601 9_at 

225277_at 

40148_at 

22821 1_at 

201334_s_at 

2014l7_at 

206241_at 

20891 8_s_at 

210038_at 

225735_at 

209253_at 

202262_x_at 

225129_at 

221969_at 

225080_at 

218338_at 

208664_s_at 

201029_s_at 

56256 at 



samples: 4/9 

1 

1 

signal-to-noise p 

4.99941703918842 0* 

-4.8835710129593 0 

-4.63399536600695 0 

3.56542456382539 0 

-3.35861431980337 0 

3.32612092220108 0 

-2.74359054965603 0 

-2.66958809534806 0 

2.59071326340578 0 

-2.55792113825771 0 

-2.54773299400117 0 

-2.33749346955264 0 

-2.29867528817227 0 

2.29232308514395 0 

2:278395672233 0 

2.24115131588386 0 

-2.22116851483018 0 

-2.20362132175544 0 

-2. 1 461 89081 001 53 0.01 

1.43796973813133 0 



decision limit 

105.35 



gene symbol 

GGA2 



APBB2 

ARHGEF12 

KPNA5 
FLJ 13052 



SCAM-1 

DDAH2 

MDS026 

PAX5 

MYOIC 

EDR1 

TTC3 

MIC2 

LOC51092 



881 



BNSDOCID: <WO_ 



_03039443A2_I_> 



WO 03/039443 



PCT/EP02/ 12303 



ALL t(8;14) vs. AML +8 

accuracy 

confidence 

gene 

213159_at 

226607_at 

205420_at 

202137_s_at 

36004_at 

203731_s_at 

212349_at 

209253_at 

213238_at 

223382_s_at 

219291_at 

212293_at 

1729_at 

225959_s_at 

205690_s_at 

243579_at 

220564_at 

210896_s_at 

221617 at 



samples: 4/ 10 

1 

1 

signal-to-noise p 

-4.39324002240111 0* 

-4.38967336864894 0 

-3.4525752660757 0 

-3.07008892450809 0 

-2.57531929513625 0.01 

-2.45523698701432 0 

-2.36608074328153 0 

-2.32838757070082 0.01 

-2.28171034767303 0 

-2.21844659554677 0 

-2.18210292149327 0 

-2.11981778828952 0 

-1.99780531744703 0 

-1.99555471271853 0.01 

1 .97864646284407 0 

1.96429396023857 0 

1.95122068807877 0 

-1.94074678448131 0 

-1 .92262620134603 0.02 



decision limit 

42.05 



gene symbol 

KIAA0805 

L3MBTL 

PEX7 

BS69 

IKBKG 

ZFP95 

POFUT1 

SCAM-1 

ATP10D 

NIN283 

MDS009 

KIAA0630 

TRADD 

NIN283 

G10 

MSI2 

FLJ11218 

ASPH 



882 



BNSDOCID: <WO_ 



_03039443A2J_> 



WO 03/039443 



PCT/EP02/12303 



ALL t(8;14) vs. AML complex 


samples: 4/36 






accuracy 


0.975 






confidence 


0.967090153309866 






failed: 


4 






gene 


signal-to-noise 


P 


decision limit gene symbol 


36004_at 


-1.99819054416994 


0 


IKBKG 


22841 0_at 


-1.91203372111762 


0 


GAB3 


226291_at 


-1.85352930781754 


0 


ALS2 


239478_x_at 


-1.83892006057297 


0 




201278_at 


-1.78640641972044 


0 


DAB2 


213073_at 


-1.78527733316412 


0 


KIAA0321 


36920_at 


-1.77637196930535 


0 


MTM1 


237864_at 


-1.75181406771053 


0 




206550_s_at 


-1.74071643864686 


0.01 


NUP155 


213779_at 


-1.71354352282537 


0 




237006_at 


-1.71319042514024 


0 




206847_s_at 


-1.68023930751716 


0 


HOW 


213639_sjat 


-1 .6794904444526 


0 


KIAA0557 


40148_at 


1.6627909781575 


0* 


64 APBB2 


210358_x_at 


-1.62491966279342 


0 


MGC2306 


205420_at 


-1.62469403094415 


0 


PEX7 


226499_at 


-1.60847740357238 


0 




212293_at 


-1.56783564141892 


0* 


524.25 KIAA0630 


22821 1_at 


-1.5662932955074 


0 




AFFX-r2-Hs28SrRNA-5_at 








HG-U133B 


0.804899059460234 


0 





BNSDOCID: <WO 03039 443A2_I_> 



883 



WO 03/039443 



PCT/EP02/12303 



ALL t(8;14) vs. AML normal 


samples: 4 / 62 




accuracy 


1 




confidence 


0.94492014468957 




gene 


signal-to-noise 


P 


242774_at 


2.1771591988175 


0 


226066_at 


-2.1383489258559 


0 


239478_x_at 


-1.86376314754514 


0 


40148_at 


1.78834322080621 


0* 


214651_s_at 


-1.72987407199338 


0 


241421_at 


-1.69137753712694 


0 


225277_at 


-1.68295316743627 


0 


237864_at 


-1.66733222775185 


0 


231181_at 


1.63598703935064 


0 


227173_s_at 


1.58512628703822 


0 


22841 0_at 


-1.58118628637501 


0 


226607_at 


-1.56767039969117 


0 


206398_s_at 


1.56041095418581 


0 


226590_at 


-1.55939441513481 


0.01 


223391_at 


1.54577336325056 


0 


205420_at 


-1.53228798256568 


0 


240106_at 


-1.53045374201368 


0 


23601 9_at 


-1.50700230486984 


0 


214558_at 


-1.50544788292166 


0 


AF FX-H U M RGE/M 1 0098_3_at 






-HG-U133B 


0.698344697289208 


0 



decision limit 



53.3 



gene symbol 
SYNE-2 



APBB2 
HOXA9 



BACH2 
GAB3 
L3MBTL 
CD19 

LOC81537 
PEX7 



GPR12 



884 



BNSDOC1D: <WO 



„03039443A2_L> 



WO 03/039443 



PCT/EP02/12303 



ALL t(8;14) vs. AML t(8;21) 

accuracy 

confidence 

gene 

40148_at 

239835_at 

225277_at 

225306_s_at 

210664_s_at 

218319_at 

212985_at 

238155_at 

221617_at 

220307_at 

225250_at 

217080_s_at 

228827_at 

214558_at 

23065CLat 

226546_at 

219478_at 

211709_s_at 

219789_at 

231181 at 



samples: 4/ 13 

i 

1 

signal-to-noise p 

3.56542456382539 0* 

3.10722969539603 0 

-2.97531197185908 0 

-2.20761036808439 0 

-2.19905382124823 0 

2.13133938217549 0 

2.10403815234864 0 

2.09456306072679 0 

-2.04115820286726 0 

-2.04090815318901 0 

2.04032419238326 0 

-2.03866301670049 0 

-2.03460798747208 0 

-2 . 02268322634707 0.01 

-2.02205697080977 0 

-2.00802538838226 0 

-1.99876002437156 0 

-1 .99340446769437 0 

-1.96292007200438 0 

1.9495774287402 0 



decision limit 

32.75 



gene symbol 

APBB2 

KIAA1842 



TFPI 
PELI1 



CD244 
STIM2 
HOMER-2B 

GPR12 



WFDC1 

SCGF 

NPR3 



885 



BNSDCCID: <WO 03039443A2_I_> 



WO 03/039443 



PCT/EP02/12303 



ALL B not Ph vs. all other 

accuracy 
confidence 
failed: 
gene 

230441_at 

202382_s_at 

229253_at 

201482_at 

214116_at 

224435_at 

229344_x_at 

229487_at 

217559_at 

212592_at 

209197_at 

223469_at 

224739_at 

218351_at 

220744_s_jat 

213582_at 

219615_s_at 

203795_s_at 

22981 7_at 

244876 at 



samples: 9 / 260 
0.988847583643123 
0.749589637027985 
1,2,8 

signal-to-noise p 

1.31003692018591 0 

-1.16916758328947 0 

-1.06515235605176 0 

-1.03614111933273 0 

-1.00681323467099 0 

0.992700829908313 0 

0.972107787450054 0 

0.965321723620396 0* 

0.963455203763842 0 

0.956367645094086 0* 

0.921039694411825- 0* 

-0.914920276631408 0* 

0.90442974974001 7 0* 

-0.898398261 455524 * 0* 

-0.89371 9737391 747 0* 

-0.888860830483358 0* 

-0.8878326695192 0* 

0.880543704938786 0 

0.879488726239199 0 

0.877426806604064 0 



decision limit gene symbol 

GNPI 
CTMP 
QSCN6 
BTD 

MGC4248 
KIAA1238 



4436.6 
637.1 



45.5 



RPL10L 

K1AA0080 

MGC10812 

MG61 

FLJ20502 

WDR10 

ATP11A 

KCNK5 

BCL7A 

DKFZP434M098 



BNSDOCID: <WO 03039443A2_L> 



886 



WO 03/039443 



PCT/EP02/12303 



ALL B not Ph vs. ALL Ph 


samples: 9/15 








accuracy 


0.916666666666667 








confidence 


1 








failed: 


3,10 ' 








gene 


signal-to-noise 


P 


decision limit 


gene symbol 


219358_s_at 


-1.52735607926994 


0 




CENTA2 


241383_at 


-1.07499870203752 


0.01 






213895_at 


-1.05505220750298 


0 




EMP1 


202123_s_at 


-1.02481061931947 


0* 


753.7 


ABL1 


20591 1_at 


-1.0236634987836 


0 




PTHR1 


242223_at 


1.02343172223498 


0.01 






211709_s_at 


-1.01491744255679 


0 




SCGF 


234839_at 


-1.00185285072786 


0 






212150_at 


-0.996731200580515 


0 




KIAA0143 


221991_at 


-0.987741661696868 


0 




NXPH3 


218543_s_at 


0.973909033712243 


0 




FLJ22693 


201874_at 


0.966766364385792 


0 




FLJ21047 


212188_at 


-0.947998533949464 


0 




LOC1 15207 


241810_at 


-0.944344027780102 


0.01 






207520_at 


-0.936622132674122 


0 






213979_s_jat 


-0.935519171149618 


0 




CTBP1 


229745_x_at 


-0.93424475879621 


0.01 






216680_s_at 


-0.934196756237652 


0 




EPHB4 


202572_s_at 


0.924395823520243 


0 




KIAA0964 


243228_at 


0.916138491705783 . 


0 







BNSDOCID: <WO 03039443A2J_> 



887 



WO 03/039443 



PCT/EP02/12303 



ALL B not Ph vs. T-ALL 

accuracy 

confidence 

gene 

242292_at 

36566_at 

23Q636_s_at 

266_s_at 

209771_x_at 

219631_at 

202113_s_at 

216379_x_at 

221969_at 

229487_at 

208650_s_at 

218464_s_at 

213944_x_at 

226496_at 

202206_at 

205504_at 

213539_at 

211101_x_at 

244876_at 

209772 s at 



samples: 9/9 

1 

1 

signal-to-noise p 

-3.53231266693437 0* 

2.04687603382912 0 

1.96129371358941 0 

1.95685220231101 0 

1.95325312023269 0 

-1 .90940390225071 0 

1.86969444770264 0 
1.84446878357139 " 0 

1.82924887172424 0 

1.78170685853126 0 

1.77571318786079 0 

-1.71465707824422 0 

1.69297405468728 0 

1.65125933296341 0 

-1.58502966518677 0 

1.58481348793145 0 

-1.56405807954932 0 

1 .55958748994576 0.01 

1.55727218904507 0 

1.54041193817483 0 



decision limit 

19.4 



gene symbol 

CTNS 

BTEB1 

CD24 

CD24 

FLJ12929 

SNX2 

PAX5 

CD24 
FLJ10700 



ARL7 
BTK 
CD3D 
LILRA2 

CD24 



888 



BNSDOCID: <WO_ 



_03039443A2_I_> 



WO 03/039443 



PCT/EP02/12303 



ALL B not Ph vs. AML +8 

accuracy 

confidence 

gene 

229253_at 

230441_at 

202382_s_at 

209135_at 

229487_at 

236656_s_at 

206438_x_at 

207403_at 

219615_s_at 

51192_at 

208248_x_at 

218109_s_at 

206295_at 

219013_at 

217979_at 

208674_x_at 

213474_at 

218351_at 

235422_at 

227709 at 



samples: 9/10 

1 

1 



signal-to-noise 


P 


decision limit gene sym 


-2.32958521030012 


0 


CTMP 


2.03717805375485 


0 




-2.02017770346207 


0 


GNPI 


-1.93398721370049 


0 


ASPH 


1.78170685853126 


0 




1.74257382856431 


0 




-1.68325155133202 


0 


FLJ 12975 


-1.66596170819766 


0 


IRS4 


-1.66280630807372 


0 


KCNK5 


-1.66236513015575 


0 


SSH-3 


-1.64870519570629 


0* 


1987.65 APLP2 


-1.60915618696108 


0 


FLJ14153 


-1 .5980305395866 


0 


IL18 


-1.59551449458991 


0 


FLJ21634 


1.59385936644152 


0 


NET-6 - 


-1.58849840915584 


0 


DDOST 


1.58394324440508 


0 




-1.58117397105311 


0 


FLJ20502 


1.57754106652849 


0 


FALZ 


1.573035497361 


0 


RCN1 



BNSDOCID: <WO 03039443A2_I_> 



889 



WO 03/039443 PCT/EP02/1 2303 



ALL B not 
complex 

accuracy 

confidence 

gene 

230441 _at 

218168_s_at 

219615_s_at 

200620_at 

229487_at 

218718_at 

203372_s_at 

205997_at 

221969_at 

209628_at 

243362_s_at 

219362_at 

244876_at 

227038_at 

215001_s_at 

202382_s_at 

211574_s_at 

203373_at 

206847_s_at 

90610 at 



Ph vs. ANIL 



samples: 9/36 
1 

0.975685516889171 

signal-to-noise p 

2.03717805375485 0 

-1.58072460849855 0* 

-1.55121221944569 0 

-1.52004960803633 0 

1.50364388130017 0 

-1.4701929847603 0 

1.43481740588652 0 

-1.40495787127453 0 

1.3801763712428 0 

-1.36215023734312 0 

1.346548683162 0 

-1.34586251735194 0 

1.32930450347737 0 

-1.3285550634404 0 

-1.31302949268385 0 

-1.31028007399306 0 

-1.27970708025177 0 

1.2683089746763 0 

-1.25861755739009 0 

0.53320359867076 0.01 



decision limit 

584.1 



gene symbol 

CABC1 
KCNK5 
C1orf8 

PDGFC 

STAT12 

ADAM28 

PAX5 

P15-2 

LEF1 

FLJ22643 



GLUL 

GNPI 

MCP 

STATI2 

HOW 

LRRN1 



BNSDOCID: <WO 03039443A2_I_> 



890 



WO 03/039443 



PCT/EP02/12303 



ALL B not Ph vs. AML normal 


samples: 9/62 






accuracy 


0.985915492957746 






confidence 


1 






failed: 


51 ' 






gene 


signal-to-noise 


P 


decision limit gene symbol 


230441 _at 


1.6244512124777 


0 




229487_at 


1.5789226463723 


0 




244876_at 


1.55727218904507 


0 




230659__at 


1.54315361034532 


0 


KIAA0212 


214651_s_at 


-1.46222218442109 


0 


H0XA9 


221969_at 


1.45958458157821 


0* 


695.5 PAX5 


22291 5_s_at 


1.43752454061174 


0 


BANK 


204215_at 


1 .3973053569742 


0 


MGC4175 


223469_at 


-1.39620868523075 


0 


MGC10812 


205997_at 


-1.3950951014202 


0 


ADAM28 


229253_at 


-1.35885129348099 


0 


CTMP 


209905_at 


-1.35690502004825 


0 


HOXA9 


236656_s_at 


1.33665521300451 


0 




205382_s_at 


-1.33068263294683 


0 - 


DF 


214761_at 


1.3284704024734 


0 


OAZ 


235753_at 


-1.3151317291794 


0 




218351_at 


-1.29563799307181 


0 


FLJ20502 


243362_s_at 


1.29364059417566 


0 


LEF1 


201482_at 


-1.28902191317183 


0 


QSCN6 


217979_at 


1.28815999568218 


, 0 


NET-6 



BNSDOCID: <WO 03039443A2_I_> 



891 



WO 03/039443 



PCT/EP02/12303 



ALL B not Ph vs. AML t(8;21) 


samples: 9/13 








accuracy 


1 








confidence 


1 








gene 


signal-to-noise 


P 


decision limit 


gene symbol 


202382_s_at 


-2.19898458962526 


0* 


48.7 


GNPI 


230441_at 


2.03717805375485 


0 






228827_at 


-2.03460798747208 


0 






219478_at 


-1.99876002437156 


0 




WFDC1 


218718_at 


-1.92298631566524 


0 




PDGFC 


203795_s_at 


1.88483347573406 


0 




BCL7A 


229487_at 


1.78170685853126 


0 






241383_at 


-1.74636612447809 


0 






236656_s_at 


1.74257382856431 


0 






23581 8_at 


-1.70111545046162 


0 






239278_at 


1.69632987254595 


0 






204647_at 


-1.69400429479148 


0 




HOMER-3 


213944_x_at 


1.69297405468728 


0 






217989_at 


-1.69293289781308 


0 




LOC51170 


205528_s_at 


-1.64467496425566 


0 




CBFA2T1 


38269_at 


1.6387714295779 


0 




PKD2 


201644_at 


1.59860594123581 


0 




TSTA3 


206622_at 


-1.58757343463781 


0 




TRH 


20431 9_s_at 


-1.57313966435531 


0 




RGS10 



892 



BNSDOCID: <WO 03O39443A2J_> 



WO 03/039443 



PCT/EP02/12303 



ALL Ph vs. all other 


samples: 15/254 








accuracy 


0.973977695167286 








confidence 


1 








failed: 


5,6,10,12,13,14,139 








gene 


signal-to-noise 


P 


decision limit 


gene symbol 


210487_at 


1.46744760454294 


0 




DNTT 


203373_at 


1.40251061555498 


0 




STATI2 


234107_s_at 


-1.30126819904145 


0 






224772_at 


1.2067398887062 


0 




MGC14961 


201540_at 


1.17259032247018 


0 




FHL1 


202123_s_at 


1.11803958640816 


0 




ABL1 


202052_s_at 


1.1015960943329 


0 




RAI14 


207971_s_at 


1 .09522659077439 


0 




KIAA0582 


218589_at 


1 .08650464406846 


0 




P2Y5 


210299_s_at 


1.08531672501274 


0 




FHL1 


227584_at 


1.05386625392045 


0 






22331 4_at 


1.04772672325566 


0 




MGC11352 


203355_s_at 


1.04604188199321 


0 




KIAA0942 


212975_at 


1.04561133965056 


0 




KIAA0870 


214505_s_at 


1.03851430654898 


0 




FHL1 


222154_s_at 


1.03658126355559 


0 




DKFZP564A2416 


209679_s_at 


1 .00639728893477 


0 




LOC57228 


230659_at 


1 .0054326445936 


0 




KIAA0212 


1007_s_at 


1.00231733413632 


0 




DDR1 


227998_at 


0.995615703854239 


. o* 


3687.45 


MGC17528 



893 



BNSDOCID: <WO 03039443A2_I_> 



WO 03/039443 



PCT/EP02/12303 



ALL Ph vs. T-ALL 

accuracy 

confidence 

gene 

218224_at 

213854_at 

221969_at 

205101_at 

213539_at 

228988_at 

208894_at 

209604_s_at 

20961 9_at 

209771_x_at 

210982_s_at 

229487_at 

235706_at 

226878_at 

219631_at 

216379_x_at 

232234_at 

210116_at 

224772_at 

213944 x at 



samples: 15/9 

1 

1 

signal-to-noise p 

-2.4418494296856 0 

2.14963630962421 0 

2.06978634038245 0* 

1.84182814954198 0 

-1.83994967875006 0 

-1.79658055608971 0 

1.79489847703276 0 

-1.75648531255811 0 

1.72743796589982 0 

1.69086499548436 0 

1.68104679025052 0 

1.66237077940113 0 

1.65050775066494 0 

1.64629305544324 0 

-1.6447182870532 0 

1.61950446576807 0 

-1.6009007845449 0 

-1.58593866198308 0 

1.58464619249453 0 

1 .58085800202538 0 



decision limit 



152.35 



gene symbol 

PNMA1 

SYNGR1 

PAX5 

MHC2TA 

CD3D 

ZNF6 

HLA-DRA 

GATA3 

CD74 

CD24 

HLA-DRA 

CPM 

FLJ12929 

C20orf24 
SH2D1A 
MGC14961 



BNSDOCID: <WO 03039 443A2J_> 



894 



WO 03/039443 



PCT/EP02/12303 



ALL Ph vs. AML+8 


samples: 15/10 






accuracy 


1 






confidence 


1 






gene 


signal-to-noise 


P 


decision limit gene syn 


210487_at 


2.5237441884906 


0 


DNTT 


203373_at 


1.87201491464778 


0 


STATI2 


202626_s_at 


-1.85118734150368 


0 


LYN 


226545_at 


1.80654821675707 


0 




219229_at 


-1.75947760308609 


0 


SLC21A1 


201828_x_at 


-1.75807186867031 


0 


CXX1 


217979_at 


1.75748092515514 


0 


NET-6 


51192_at 


-1.7460030120736 


0 


SSH-3 


229302_at 


1.73677596348294 


0 




226438_at 


-1.70384127791425 


0 




227367_at 


-1.69193353561271 


0 




229487_at 


1.66237077940113 


0 




234107_s_at 


-1.64136875764363 


0* 


340.6 


204672__s_at 


-1.64125970689002 


0 


ANKRD6 


202625_at 


-1.61126391589076 


0 


LYN 


231887_s_at 


1 .60675392068253 


0 


KIAA1274 


218942_at 


-1.59127524427508 


0 


FLJ22055 


210749_x_at 


1.58213182698513 


0 


DDR1 


2l3150_at 


-1.57881564631984 


0 


HOXA10 


59697_at 


1.03007856466368 


0 





BNSDOCID: <WO 03O39443A2_l_> 



895 



WO 03/039443 



PCT/EP02/12303 



ALL Ph vs. ANIL complex 


samples: 15/36 






accuracy 


0.980392156862745 






confidence 


1 






failed: 


41 






gene 


signal-to-noise 


P 


decision limit gene symbol 


203373_at 


2.31025004347829 


0 


STATI2 


206847_s_at 


-1.68023930751716 


0 


HOXA7 


203372_s_at 


1.63947005879125 


0 


STATI2 


234107_s_at 


-1.61839900546276 


0 




21Q487_at 


1.52754184797369 


0 


DNTT 


214651_s_at 


-1.49910544616071 


0* 


250.95 HOXA9 


20961 9_at 


1.49461107113623 


0 


CD74 


218718_at 


-1.42002659230416 


0 


PDGFC 


225660__at 


1.39520949215218 


0 


SEMA6A 


226878_at 


1.38541949367152 


0 




235753_at 


-1.36744735557489 


0 




22981 7_at 


1.35899260632582 


0 


DKFZP434M098 


229487_at 


1.334779526707 


0 




214761_at 


1.33168680907428 


0 


OAZ 


225782_at 


-1.32289994708003 


0 




209905_at 


-1.31131029251837 


0 


HOXA9 


235521_at 


-1.3094635283322 


0 


HOXA3 


221969_at 


1.30539974000703 


0 


PAX5 


213150_at 


-1.28908316992088 


0 


HOXA10 


243363_at 


1.22268997650812 


0 


LEF1 



BMSDOCID: <WO 03039443A2J_> 



896 



WO 03/039443 



PCT/EP02/ 12303 



ALL Ph vs. AML normal 


samples: 15/62 








accuracy 


1 








confidence 


0.974929287250918 








gene 


signal-to-noise 


P 


decision limit 


gene symbol 


214651_s_at 


-1.85202940734635 


0* 


143.55 


HOXA9 


203373_at 


1.62915723435284 


0 




STATI2 


234107_s_at 


-1.59102595714414 


0 






22981 7_at 


1.58719561744707 


0 




DKFZP434M098 


235753_at 


-1.57582123676185 


0 






224772_at 


1.55894528337306 


0 




MGC14961 


210487_at 


1.55488151717647 


0 




DNTT 


212012_at 


1.4856092975255 


0 




D2S448 


223449_at 


1.48450202420019 


0 




SEMA6A 


209905_at 


-1.46906077761321 


0 




HOXA9 


213150_at 


-1.46604197311664 


0 




HOXA10 


212975_at 


1.45775514780451 


0 




KIAA0870 


221969_at 


1.42534915280476 


0 




PAX5 


206847_s_at 


-1.42447895990314 


0 




HOXA7 


214761_at 


1.42199062015385 


0 




OAZ 


229487_at 


1.42117416701658 


0 






217979_at 


1.40224984357617 


0 




NET-6 


225660_at 


1.39520949215218 


0 




SEMA6A 


212013_at 


1.38743332601863 


0* 


595.1 


D2S448 


22331 4_at 


1.38610020402626 


0 




MGC 11352 



BNSDOCID. <WO 03039443A2_I_> 



897 



WO 03/039443 PCT/EP02/1 2303 



ALL Ph vs. AMLt(8;21) 

accuracy 

confidence 

gene 

210487_at 

203373_at 

228827_at 

226545_at 

224928_at 

218718_at 

221581_s_at 

201811_x_at 

201486_at 

207655_s_at 

223467_at 

224764_at 

218237_s_at 

212535_at 

202123_s_at 

50221_at 

202600_s_at 

208146_s_at 

210151_s_at 

229487 at 



samples: 15/13 

1 

1 

signal-to-noise p 

2.25266231447204 0 

2.24099300433703 0 

-2. 03460798747208 0* 

1.91066898371188 0 

1.88688841526862 0 

-1.86717048634787 0 

1.85128137775803 0 

1.82001958474325 0 

1.78302051564335 0 

1.77674162168543 0 

1.77014181133384 0 

1.76474506640734 0 

1.7513955350881 0 

1.73463136017207 0 

1.72757337414589 0 

1.71143815565141 0 

1.69190236878682 0 

1.67349162864443 0 

1.66867162916475 0 

1.66237077940113 0 



decision limit 



162.15 



gene symbol 

DNTT 

STATI2 



PDGFC 

WBSCR5 

SH3BP5 

RCN2 

BLNK 

RASD1 

ARHGAP10 

SLC38A1 

ABL1 

NRIP1 
CPVL 
DYRK3 



BNSDOCID: <WO 03039443A2_I_> 



898 



WO 03/039443 PCT/EP02/1 2303 



T-ALL vs. all other 


samples: 9 / 260 






accuracy 


0.988847583643123 






confidence 


0.974096654425726 






failed: 


2,8,239 






gene 


signal-to-noise 


P 


decision limit gene symbol 


213539_at 


1.47540002688666 


0 


CD3D 


230588_s_at 


1.40173530720654 


0 


MCPR 


209604_s_at 


1.29998211445613 


0* 


1756.45 GATA3 


233589_x_at 


-1.29177430222739 


0* 


1 


201416_at 


1.28847513298777 


0 


SOX4 


228988__at 


1.22715826655642 


0 


ZNF6 


205640__at 


-1.20119307753531 


0 


ALDH3B1 


201417_at 


1.20045529988474 


0 




228174_at 


1.19343398440307 


0 




206804_at 


1.19196138585152 


0 


CD3G 


218913_s_at 


-1.18780274620161 


0 


LOC51291 


221188_s_at 


-1.17857568967097 


0 


CIDEB 


221555_x_at 


-1.15605774789743 


0 


CDC14B 


229280_s_at 


1.14891159995685 


0 




219079_at 


-1.1401640005663 


0 


b5&b5R 


228242_at 


1.13931660332537 


0 




225003_at 


-1.13574793214416 


0 


MBC3205 


205504_at 


-1.1269796752374 


0 


BTK 


209499_x_at 


-1.12273253976539 


0 


TNFSF13 


226342_at 


1.11835127434314 


, o 





BNSDOCID: <WO 03039443A2_I_> 



899 



WO 03/039443 



PCT/EP02/12303 



T-ALL vs. AML +8 

accuracy 

confidence 

gene 

233589_x_at 

211495_x_at 

209499_x_at 

201416_at 

227999_at 

205640_at 

218913_s_at 

242292_at 

227729_at 

206295_at 

218341_at 

217989_at 

201200_at 

202626_s_at 

201417_at 

201985_at 

219013_at 

219329_s_at 

230588_s_at 

39650 s at 



samples: 9/10 

1 

1 

signal-to-noise p 

-5.40566971880279 0* 

-3.24639126977955 0 

-2.48173374690971 0 

2.45175574451932 0 

-2.41422896946156 0 

-2.36735951723515 0 

-2.26146566262815 0 

2.13276982228889 0 

2.11399761533966 0 

-2.10263843038377 0 

-2.09693358964157 0 

-2.01015824384982 0 

-1.9921136873539 0 

-1.97579203488485 0 

1.94726816387174 0 

-1.9229605331952 0 

-1.91889126390119 0 

-1.91383490407803 0 

1.87407116981284 0 

1.68662830524108 0 



decision limit 

94.05 



gene symbol 

TNFSF13 
TNFSF13 
SOX4 

LOC 170394 

ALDH3B1 

LOC51291 



IL18 " 

FLJ11838 

LOC51170 

CREG 

LYN 

KIAA0196 

FLJ21634 

APR-3 

MCPR 

KIAA0435 



900 



BNSDOCID: <WO_ 



_03039443A2_ 



WO 03/039443 



PCT/EP02/ 12303 



T-ALL vs. AML complex 


samples: 9/36 








accuracy 


0.977777777777778 








confidence 


1 








failed: 


8 








gene 


signal-to-noise 


P 


decision limit 


gene symbol 


242292_at 


1 .76638958058548 


0 






233589_x_at 


-1.72473600410199 


0 






201200_at 


-1.67373612904555 


0 




CREG 


221188_s_at 


-1.58025708973943 


0 




CIDEB 


214430_at 


-1.57548239699241 


0 




GLA 


203370_s_at 


-1.49153229697191 


0 




ENIGMA 


213101_s_at 


-1.48624142663186 


0 




IL27 


207543_s_at 


-1.47703523852874 


0* 


244.15 


P4HA1 


200996_at 


-1.46058173493331 


0 




ACTR3 


228242_at 


1 .42599890928864 


0 






200625_s_at 


-1.41869234255251 


0 




CAP 


212271_at 


-1.41675681954144 


0 




MAPK1 


213798_s_at 


-1.3995900667655 


0 




CAP 


218341_at 


-1.38917958617976 


0 




FLJ11838 


207809_s_at 


-1.38033524444329 


0 




ATP6IP1 


20141 6_at 


1.36993537921948 


0 




S0X4 


21401 7_s_at 


-1.36371429454275 


0 




DDX34 


210145_at 


-1.35854949316142 


0 




PLA2G4A 


221006_s_at 


-1.35518233243855 


0 




MY014 



BNSDOCID: <WO 03039443A2J_> 



901 



WO 03/039443 PCT/EP02/1 2303 



T-ALL vs. AML normal 

accuracy 

confidence 

gene 

230292_jat 

201200_at 

202479_s_at 

213539_at 

22491 8_x_at 

227999_at 

219812_at 

228007_at 

231736_x_at 

219079_at 

204484_at 

201537_s_at 

210613_s_at 

217989_at 

207996_s_at 

200765_x_at 

205382_s_at 

201463_s_at 

233589_x_at 

39650 s at 



samples: 9/62 
1 

0.957860143913647 

signal-to-noise p 

1.88640079114873 0* 

-1.83242104341312 0 

1.72124560429389 0* 

1.67079068669821 0 

-1.58916954867569 0 

-1.50019492619142 0 

1.47878284715341 0 

1.4773281277912 0 

-1.46304321667415 0 

-1.46Q351 85202908 0 

1.43138015243683 0 

-1.41942605715997 0 

-1.41293569747837 0 

-1.41116959479258 0 

1.40679076628897 0 

-1.40654043227196 0 

-1.40167577220905 0 

-1.39170471415992 0 

-1.39076513944045 0 

1.23989100884988 0 



decision limit 

209.2 

139 



gene symbol 

CREG 

GS3955 

CD3D 

MGST1 

LOC1 70394 

MGC2463 

MGST1 

b5&b'5R 

PIK3C2B 

DUSP3 

SYNGR1 

LOC51170 

C18orf1 

CTNNA1 

DF 

TALDOI 
KIAA0435 
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T-ALL vs. AMLt(8;21) 

accuracy 

confidence 

gene 

242292_at 

203949_jat 

203948_s_at 

217989_at 

223482_at 

203795_s_at 

218224_at 

217080_s_at 

23601 9_at 

219477_s_at 

228988_at 

201486_at 

219478_at 

228266_s_at 

207996_s_at 

222203_s_at 

205640_at 

210613__s_at 

228174_at 

223474 at 



samples: 9/13 

1 

1 

signal-to-noise p 

3.53231266693437 0* 

-3.12642743394317 0 

-2.45792853300794 0 

-2.43298080062421 0 

-2.33409794008859 0 

2.1853673808492 0 
2.09498423445912 ' 0 

-2.03866301670049 0 

2.02213343587896 0 

2.01911441063646 0 

2.01251275923737 0 

1.99987326641982 0 

-1.99876002437156 0 

1.98746817521137 0 

1.96448548584508 0 

1.94383192684329 0 

-1.93350841183487 0 

-1.93015512462768 0 

1.9244800134633 0 

1.91826574900603 0 



decision limit 

19.4 



gene symbol 

MPO 
MPO 

LOC51170 

TMPIT 

BCL7A 

PNMA1 

HOMER-2B 

LOC55901 

ZNF6 

RCN2 

WFDC1 

CGI-142 

C18orf1 

ALDH3B1 
SYNGR1 

C14orf4 
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AML +8 vs. all other 


samples: 10/259 








accuracy 


0.966542750929368 








confidence 


1 








failed: 


1,2,3,5,6,7,8,9,10 








gene 


signal-to-noise 


p 


decision limit 


gene symb 


212250_at 


1.00891676230186 


0 






218549_s_at 


0.942516384676453 


0 




LOC51115 


212449_s_at 


0.88821669704833 


0 




LYPLA1 


205849_s_at 


0.882097788580276 


0 




UQCRB 


222166_at 


-0.873718781745549 


0 






203007_x_at 


0.87144371195471 


0 






242975_s_at 


-0.862022087552992 


0 




GNAS 


208697_s_at 


0.80816796889713 


0 




EIF3S6 


201592_at 


0.794789783421717 


0 




EIF3S3 


218482_at 


0.794345693087361 


0 




DC6 


201985_at 


0.784318906218296 


0 




KIAA0196 


231101_at 


-0.7838812614223 


0 




PPP2R5E 


218642_s_at 


0.776600267973484 


0 




MGC2217 


201652_at 


0.761703817525426 


0 


* 


COPS5 


201433_s_at 


0.761252302206581 


0 




PTDSS1 


217916_js_at 


0.757235902578392 


0 




BM-009 


226165_at 


0.738121051467492 


0* 


1969.6 


E2F5 


203110_at 


0.736663801658822 


0 




PTK2B 


224804_s_at 


-0.734849706050297 


0 




FLJ00005 
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AML +8 vs. AML complex 

accuracy 

confidence 

gene 

222229_x_at 

208697_s_at 

205849_s_at 

212586_at 

208646_at 

230795_at 

202746_at 

226545_at 

222166_at 

210715_s_at 

228652_at 

217979_at 

201548_s_at 

202747_s_at 

201602_s_at 

205674_x_at 

244740_at 

219518_s_at 

227249_at 



samples: 10/36 
1 

0.855027999708228 

signal-to-noise p 

1.23332137897808 0 

1.15066098039301 0 

1.14869307530221 0 

1.11153173777253 0* 

1.07422424644875 0 

-1.06732853628123 0 

-1.03422849717783 0 

-1.00972156245463 0* 

-0.99982898290901 1 0* 

-0.978857795907468 0* 

-0.974511468277957 0 

-0.973647566321 829 0* 

-0.970651188910704 0* 

-0.952100249128964 0 

-0.951073671804128 0 

-0.945871996779656 0 

-0.942099475768156 0 

-0.938970502867646 0 

-0.931085853722028 0 



decision limit gene symbol 



1231.4 



100.15 
1 

573.3 

234.35 
289.85 



EIF3S6 
UQCRB 
ARTS-1 

H4F2 



SPINT2 

NET-6 

PLU-1 

ITM2A 

PPP1R12A 

FXYD2 

FLJ22637 
NUDE1 
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AML +8 vs. AML normal 

accuracy 
confidence 
failed: 
gene 

214394_x_at 

203110_at 

203007_x_at 

235124_at 

212251_at 

242975_s_at 

206781 _at 

225406_at 

218482_at 

237291_at 

235647_at 

234726_s_at 

225889_at 

236837_x_at 

222166_at 

228092_at 

212250_at 

226450_at 

238392_at 

200655 s at 



samples: 10/62 

0.944444444444444 

1 

2,8,9,10 

signal-to-noise p 

1.16987396866816 0 

1.09167544137433 0 

1.00013993270134 0 

-0.983829818030274 0 

0.969996357896378 0* 

-0.968085578079267 0 

-0.939680187764268 0 

-0.924590937077397 0 

0.900623425034112 0 

-0.889981273455645 0.01 

-0.87846991187601 0 

-0.877362222213618 0 

-0.85606834671728 0 

-0.842273497900729 0 

-0.840979666931678 0 

-0.833074953827583 0 

0.830552344097529 0 

0.827753984186341 0 

-0.812637849766076 0 

0.811239676737242 0 



decision limit 



gene symbol 

FLJ20897 

PTK2B 



4073.8 



GNAS 
DNAJC4 
TSG 
DC6 



CALM1 
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AML+8 vs. AMLt(8;21) 


samples: 10/13 








accuracy 


1 








confidence 


1 








gene 


signal-to-noise 


p 


decision limit 


gene symbol 


228827_at 


-2.03460798747208 


0* 


162.15 




203188_at 


1.98171895993959 


0 




B3GNT6 


209522_s_at 


1.87499941760774 


0 




CRAT 


50221_at 


1.77237042593137 


0 






212586_at 


1.70313290894867 


0 




ARTS-1 


217816_s_at 


1.69811663165145 


0 




pcnp 


219084_at 


1.68534097481951 


0 




NSD1 


241370_at 


1.66453119103938 


0 






221581_s_at 


1.64609943226078 


0 




WBSCR5 


205528_s_at 


-1.64467496425566 


0 




CBFA2T1 


211341_at 


-1.63310072579237 


0 




P0U4F1 


212250_jat 


1.62854737716103 


0 






206940_s_at 


-1.60737424398258 


0 




P0U4F1 


205529_s_at 


-1.591491111654 


0 




CBFA2T1 


213150_at 


1.5788-1564631984 


0 




HOXA10 


214394_x_at 


1.57039311243451 


0 




FLJ20897 


212895_s_at 


1.56878968456679 


0 




ABR 


218341_at 


1.53928870336148 


0 




FLJ11838 


204249_s_at 


1.52597204656408 


0 




LM02 


214651_s_at 


1.52240931149048 


0 




HOXA9 
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AML complex vs. all other 

accuracy 
confidence 
failed: 
gene 

222229_x_at 

200608_s__at 

219793_at 

214700_x_at 

216652_s_at 

209523_at 

203904_x_at 

209188_x_at 

207654_x_at 

22331 8_s_at 

201437_s_at 

227056_at 

209259_s_at 

216304_x_at 

201263_jat 

213452_at 

225344_at 

203133_at 

20381 8_s_at 

202413 s at 



samples: 36 / 233 

0.940520446096654 

0.921791251652985 

1 ,3,4,7,9, 1 1 , 1 3, 1 7, 1 9,22,26,29, 30, 31 ,36, 1 59 



signal-to-noise p 

-0. 893099830348847 0 

0.829215272839729 0* 

0.800006992121134 0* 

0.78695642651388 0 

0.767651877577316 0 

0.752134816617307 0* 

0.746250973896829 0 

0.739903310934254 0 

0.732205254479051 0 

-0.730537441809073 0 

0.717807687112266 0 

-0.715710672900953 0 

0.712514692902903 0 

0.711429386769807 0 

0.705291260136426 0 

0.700628406937312 0 

0.690845684896899 0 

0.686789233495541 0 

0.685434479377027 0 

0.684073197719258 0 



decision limit gene symbol 



1754.85 
82.35 



681.95 



RAD21 
SNX16 



KAI1 
DR1 
DR1 

MGC 10974 
EIF4E 
KIAA0141 
CSPG6 

TARS 
ZNF184 

SEC61B 

SF3A3 

USP1 



908 
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AMI rnmnlpx v<5 AMI normal 


^amoles* 36 / 62 








a/**!"*! i ra r*\/ 


0 9591 83673469388 








rnnfirlonro 


0 827538830649198 

1 










4 1 7 26 95 








gene 




n 


decision limit 


gene symbol 




-1 1 C >9R99R'34C)6895 


o 






997P^fi at 


-1 01959539961157 


o 




KIAA0141 


ZUUU C? O O a L - riO \J I OOD 


-0 981168516451417 


0* 


3052 25 


HINT1 


at 


-fj Q4679^179R0993 


0* 


3970.8 


YR-29 




-f) Q979Q9Q49476145 


0* 


54 


HOXB6 


onncnfl e -at 
£ uuou o_s__at 


n RR 14919^0797578 


n* 


1 JUU.Z 


RAD21 


OQ^^no -at 


fJ R5R75fifl91 591 R7R 

-U.OOOOJuOZ I JZ lOf o 


n 






/.uuuyo s at - rio-u i oom 


n R4Q^R4Rfi^nfiQn^ 






H1NT1 


2uo4 oo__s_ai 


n R4RQ1495R9R1 541 






H9AFX 


2233 io_S_at 


n R4'*905'3n471fi4'}5 


n 
u 




MHP10Q74 


onnnoQ o ot uir^ i hood 
2UUU2o_S__at - Mo-UlooD 


n R^4Q51^47QQQfi4Q 


n 




Lll JOJ 


2U1 io4_s_at 


n Q9^A 10^791 191=^7 
U.O^OD JUO/ £. I I ZOO/ 


n 




PI IM1 


203345_s_at 


0.823373110066695 


0 




M96 


212251_at - 


0.821052485124022 


0 






218645_at 


-0.816440269243142 


0 




ZNF277 


236728_at 


-0.815324685029762 


0 






21491 1_s_at 


0.812703140158658 


0 






236892_s_at 


-0.810032594833065 


0 




H0XB6 


211950_at 


0.809875752517154 


0* 


1367.5 


RBAF600 


207983_s_at 


0.808929428489603 . 


0 




STAG2 
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AML complex vs. AML t(8;21) 


samples: 36/13 








accuracy 


1 








confidence 


0.956366908409342 








gene 


signal-to-noise 


P 


decision limit 


gene symbol 


201851_at 


1.64654123030477 


0 




SH3GL1 


228827_at 


-1.63837794328364 


0* 


277.15 




214651_s_at 


1.52618371146822 


0 




HOXA9 


23552 1_at 


1.52417126144756 


0 




HOXA3 


204249_s_at 


1.52365997713542 


0* 


1581.95 


LM02 


203904_x_at 


1.49494399498725 


0 




KAI1 


206940_s_at 


-1.48076766016125 


0 




POU4F1 


211341_at 


-1.47540487159533 


0 




POU4F1 


209259_s_at 


1.45032016623088 


0 




CSPG6 


212058_at 


1.43907164424644 


0 




KIAA0332 


218577_at 


1.42304726090167 


0 




FLJ20331 


217963_s_at 


1.41730181206619 


0 




HCS 


206622_at 


-1.41476408655437 


0 




TRH 


200071_at-HG-U133A 


1.4105258517176 


0 




SPF30 


218933_at 


1.38255573216414 


0 




MGC5347 


205528_s_at 


-1.38150242627303 


0 




CBFA2T1 


218331_s_at 


1.3810903886183 


0 




FLJ20360 


202406_s_at 


1.3795610685034 


0 




TIAL1 


220796_x_at 


1.36377544613334 


0 




FLJ 14251 


218582_at 


1.35888148459997 


0 




FLJ20445 
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AML normal vs. all other samples: 62 / 207 

accuracy 0.89591 07806691 45 

confidence 0.964449931 704009 

5,14,20,27,30,32,35,37,38,39,42,43,45,48,50,52,53,55,56,57,61,213, 



failed: 


21 5,21 7,21 9,227,230,254 




♦ 


gene 


signal-to-noise 


P 


decision limit gene symbol 


236892_s_at 


0.883816204836633 


0* 


60.95 HOXB6 


228904_at 


0.842467327362967 


0 




239791_at 


0.82014114184115 


0 


HOXB6 


205601_s_at 


0.78108191394345 


0 


HOXB5 


205600_x_at 


0.763742762546178 


0 


HOXB5 


205366_s_at 


0.755814747220724 


0* 


322.65 HOXB6 


230743_at 


0.685019971063637 


0 




205453_at 


0.673160741612025 


0 


HOXB2 


214651_s_at 


0.668311761484385 


0 


HOXA9 


200679_x_at 


-0.662124332081048 


0 


HMG1 


225406_at 


0.660717097117062 


0 


TSG 


200093__s_at - HG-U133B 


0.660005840449192 


0 


HINT1 


206295_at 


0.640927517926634 


0 


IL18 


235753_at 


0.637566839566103 


0 




217768_at 


0.632918049830813 


0 


LOC51637 


225553_at 


0.62830766714963 


0 




225175_s_at 


-0.624292365035176 


0 


CTL2 


200093_s_at - HG-U133A 


0.621189010096421 


0 


HINT1 


214911_s_at 


-0.615055858265367 , 


0 




241464_s_at 


0.611746504394311 


0 
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AML normal vs. AML t(8;21) 

accuracy 

confidence 

gene 

228827_at 

214651_s_at 

205528_s_at 

235753_at 

205529_s_at 

211341_at 

206940_s_at 

209905_at 

213150_at 

217816_s_at 

219598_s_at 

205453_at 

223498_at 

214000_s_at 

23581 8_at 

213844_at 

235521_at 

227279_at 

222448_s_at 

213147 at 



samples: 62/13 

1 

1 

signal-to-noise 

-2.03460798747208 

1.87986749903764 

-1.64467496425566 

1.56101146788885 

-1.4994140185132 

-1.4972813333234 

-1.48355335484551 

1.46906077761321 

1.46604197311664 

1.45253445141557 

1.41726065845379 

1.38558116676561 

1.33655832782001 

-1.2765364955951 

-1.27371437711255 

1.25995066545668 

1.2540451261169 

1.2513845109424 

1.24728565933231 

1.24192806323509 



P 

0* 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 



decision limit 

162.15 



gene symbol 

HOXA9 
CBFA2T1 

CBFA2T1 

POU4F1 

POU4F1 

HOXA9 

HOXA10 

pcnp 

HOXB2 

RGS10 

HOXA5 
HOXA3 
MGC 15737 
UMP-CMPK 
HOXA10 



BNSDOCID: <WO 03O39443A2J_> 



912 



WO 03/039443 



PCT/EP02/12303 



AML t(8;21) vs. all other 

accuracy 
confidence 
failed: 
gene 

228827_at 

205528_s_at 

205529_s_at 

206622_at 

206940_s_at 

211341_at 

212828_at 

212480_at 

219478_at 

219598_s_at 

20481 1_s_at 

214000_s_at 

221581_s_at 

224764_at 

34689_at 

227279_,at 

217816_s_at 

207760_s_at 

23801 2_at 

AFFX-r2-Hs28SrRNA-3_at 
HG-U133B 



samples: 13/256 

0. 996282527881041 
1 

190 

signal-to-noise p 

1.85657763737458 0 

1 .5282463620506 0 

1.44069436893782 0* 

1.31883260079515 0 

1.30418339343959 0 

1.2136561137121 0 

-1.0700055152284 0 

-1.05079328033116 0 

1. "04231 267665759 0 
-1.04225165555028 0 
1.03799840690676 0 
1.01734079918989 0 
-0. 964830298373625 0 
-0.954444787821262 0 
-0.942804241091645 0 
-0.942193016486907 0 
-0.934103105967049 0 
-0.925434637488157 0 
-0. 924942905482366 0 

0.833222345890935 0 



decision limit gene sym6ol 



157.8 



CBFA2T1 

CBFA2T1 

TRH 

POU4F1 

POU4F1 

SYNJ2 

KIAA0376 

WFDC1 

CACNA2D2 

RGS10 

WBSCR5 

ARHGAP10 

TREX1 

MGC15737 

pcnp 

NCOR2 
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Table 30: Analysis of 14 leukemia subgroups and normal bone marrow according to the method, described by Westfall & 
Young. The 500 most significant genes are listed. The raw p value as well as the adjusted p value are shown. Furthermore 
it is indicated whether the genes were also identified as discriminative by the method described by Golub et al. (* most 
important gene in pairwise comparisons, + one of the top 20 significant genes) 



genelD 


' Golub 


rawp adjp 




Gene symbol 


201497_x_at 


* 


0.0001 


0.0001 


MYH11 


201998_at 


+ 


0.0001 


0.0001 


SIAT1 


204150_at 


+ 


0.0001 


0.0001 


STAB1 


20451 1_at 




0.0001 


0.0001 


KIAA0793 


2Q5805_s_at 




0.0001 


0.0001 


ROR1 


206255_at 


+ 


0.0001 


0.0001 


BLK 


206337_at 




0.0001 


0.0001 


CCR7 


207802_at 


+ 


0.0001 


0.0001 


SGP28 


20781 9_s_at 




0.0001 


0.0001 


ABCB4 


208091_s_at 


+ 


0.0001 


0.0001 


DKFZP564K0822 


208456_s_at 


+ 


0.0001 


0.0001 


RRAS2 


209061_at 




0.0001 


0.0001 


NCOA3 


209374_s_at 


+ 


0.0001 


0.0001 


IGHM 


210244_at 




0.0001 


0.0001 


CAMP 


212313_at 




0.0001 


0.0001 




212531_at 


* 


0.0001 


0.0001 


LCN2 


212590_at 


+ 


0.0001 


0.0001 




212827_at 


+ 


0.0001 


0.0001 


IGHM 


212953_x_at 


* 


0.0001 


0.0001 


CALR 


214450_at 


* 


0.0001 


0.0001 
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23741 1_at 




0.0001 


0.0013 LOC153516 


23851 6_at 




0.0001 


0.0013 BMPR2 


242866_x_at 




0.0001 


0.0013 


243030_at 




0.0001 


0.0013 


266_s_at 


+ 


0.0001 


0.0013 CD24 


35974_at 


+ 


0.0001 


0.0013 LRMP 


203057_s_at 




0.0001 


0.0014 PRDM2 


205105_at 




0.0001 


0.0014 MAN2A1 


210763_x_at 




0.0001 


0.0014 LY117 


212569_at 




0.0001 


0.0014 KIAA0650 


212886_at 




0.0001 


0.0014 DKF2P434C171 


221331_x_at 




0.0001 


0.0014 CTLA4 


239292_at 




0.0001 


0.0014 


202052_s_at 


+ 


0.0001 


0.0015 RAI14 


207734_at 




0.0001 


0.0015 FLJ20340 


209822_s_at 


• 


0.0001 


0.0015 VLDLR 


210299_s_at 


+ 


0.0001 


0.0015 FHL1 


211771_s_at 




0.0001 


0.0015 POU2F2 


219090_at 




0.0001 


0.0015 SLC24A3 


219667_s_at 




0.0001 


0.0015 BANK 


226258_at 




0.0001 


0.0015 


227584_at 


+ 


0.0001 


0.0015 


201061_s_at 




0.0001 


0.0017 EPB72 


202863_at 




0.0001 


0.0017 SP100 


204069_at 


* 


0.0001 


0.0017 MEIS1 


204118_at 




J 0.0001 


0.0017 CD48 


204163_at 




0.0001 


0.0017 EMILIN 


206245_s_at 




0.0001 


0.0017 NS1-BP 


208651_x_at 


+ 


0.0001 


0.0017 CD24 


209236_at 




0.0001 


0.0017 


211796_s_at 




0.0001 


0.0017 TRB 


212231_at 




0.0001 


0.0017 FBX021 


213600_at 




0.0001 


0.0017 K1AA0545 


218237_s_at 


+ 


0.0001 


0.0017 SLC38A1 


218614_at 




0.0001 


0.0017 FLJ10652 


227606_s_at 




0.0001 


0.0017 KIAA1373 
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230877_at 




0.0001 


0.0017 IGHG3 


i31794_at 




0.0001 


0.0017 CTLA4 


200068_s_at - HG-U133B 




0.0001 


0.0018 CANX 


200953_s_at 


+ 


0.0001 


0.0018 CCND2 


201810_s_at 




0.0001 


0.0018 SH3BP5 


202421_at 




0.0001 


0.0018 IGSF3 


203143_s_at 




0.0001 


0.0018 KIAA0040 


203355_s_at 


+ 


0.0001 


0.0018 KIAA0942 


203796_s_at 


+ 


0.0001 


0.0018 BCL7A 


204670_x_at 


+ 


0.0001 


0.0018 HLA-DRB5 


204891_s_at 




0.0001 


0.0018 LCK 


207168_s_at 


+ 


0.0001 


0.0018 H2AFY 


207269_at 


+ 


0.0001 


0.0018 DEFA4 


208894_at 


+ 


0.0001 


0.0018 HLA-DRA 


209827_s_at 




0.0001 


0.0018 IL16 


211138_s_at 




0.0001 


0.0018 KMO 


21231 1_at 




0.0001 


0.0018 KIAA0746 


214575_s_at 


+ 


0.0001 


0.0018 AZU1 


220059_at 




0.0001 


0.0018 BRDG1 


221234_s_at 




0.0001 


0.0018 BA6H2 


221778_at 




0.0001 


0.0018 K1AA1718 


221865_at 




0.0001 


0.0018 


222477_s_at 




0.0001 


0.0018 TM7SF3 


224609_at 




0.0001 


0.0018 CTL2 


225136_at 




0.0001 


0.0018 


227242_s_at 




0.0001 


0.0018 


228083_at 


+ 


0.0001 


0.0018 


228343_at 




0.0001 


0.0018 POU2F2 


228551_at 




0.0001 


0 r 0018 


229168_at 




0.0001 


0.0018 


230551_at 




0.0001 


0.0018 


231332_at 




0.0001 


0.0018 


242774_at 


+ 


0.0001 


0.0018 SYNE-2 


243154_at 


+ 


0.0001 


0.0018 


243932_at 




0.0001 


0.0018 


38269_at 


+ 


0.0001 


0.0018 PKD2 


200650_s_at 




0.0001 


0.0019 LDHA 


201825_s_at 




0.0001 


0.0019 LOC51097 


205599_at 




0.0001 


0.0019 TRAF1 


212660_at 




0.0001 


0.0019 KIAA0239 


219029_at 




0.0001 


0.0019 FLJ21657 


222520_s_at 




0.0001 


0.0019 ESRRBL1 
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22291 5_s_at + 
224520_s_at 

227900_at + 
236796_at 

236979_at + 

242388_x_at 

242520_s_at 

37831_at 

AFFX-HUMGAPDH/M331 97_M_at 

HG-U133B 

205624_at 

209369_at 

214761_at + 

218531_at 

227568_at 

230834_at 

230986_at 

235023_at 

235982_at 

201362_at 

202080_s_at + 

202606_s_at 

204205_at 

204351_at + 

2G5223_at 

20541 4_s_at 

208302_at 

209365_s_at 

211502_s_at 

211883_x_at 

211889_x_at 

212012_at + 
213370_s_at 

218589_at + 

219221_at 

223321_s_at 

223894_s_at + 
22481 1_at 
22501 9_at 
22591 7_at 

226043_at + 
226252 at 



0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 

0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 



0.0019 BANK 

0.0019 MGC13168 

0.0019 

0.0019 

0.0019 

0.0019 

0.0019 

0.0019 KIAA0545 
0.0019 GAPD 



0.002 
0.002 
0.002 
0.002 
0.002 
- 0.002 
0.002 
0.002 
0.002 
0.0021 
0.0021 
0.0021 
0.0021 
0.0021 
0.0021 
0.0021 
0.0021 
0.0021 
0.0021 
0.0021 
0.0021 
0.0021 
0.0021 
0.0021 
0.0021 
0.0021 
0.0021 
0.0021 
0.0021 
0.0021 
0.0021 
0.0021 



CP A3 

ANXA3 

OAZ 

FLJ21749 



FCRH1 

NS1-BP 

KIAA1042 

TLK1 

MDS019 

S100P 

KIAA0645 

KIAA0672 

HB-1 

ECM1 

PFTK1 

CEACAM1 

CEACAM1 

D2S448 

DKFZP434L243 

P2Y5 

FLJ22332 

FGFRL1 

FTS 

CAMK2D 

DKFZp762B226 

AGS3 
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226560_at 
227998_at 
229001_at 
230689_at 
23141 8_at 
239054_at 
243362 s at 



0.0001 
0.0001 
0.0001 
0.0001 
0.0001 
0.0001 

+ 0.0001 



0.0021 

0.0021 MGC17528 

0.0021 

0.0021 

0.0021 MS4A2 
0.0021 
0.0021 LEF1 
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Table 31a: !n total 46 cases of ALL were analyzed. 44 of 46 cases (95.7%) were assigned to the correct ALL 
subtype in all pairwise comparisons. The sensitivity indicated for each subgroup indicates the percentage of 
cases of the specific subgroup identified correctly in all pairwise comparisons. The specificity indicates for 
each subgroup the percentage of correct assignments to this subgroup. 





ALLt(4;11) 


ALLt(8;14) 


ALL B not 
Ph 


ALL Ph 


T-ALL 


total 


sensitivity % 


specificity % 


ALL t(4;11) 












9 


100.00 


100.00 


ALL t(0, IH) 












4 


100.00 


100.00 


ALL B not 
Ph 






j , % I 

r s%z i~y-- s.-./s. 

Wkmmmmmm 


1 




9 


88.89 


88.89 


ALL Ph 






1 


7811 




15 


93.33 

t 


93.33 


T-ALL 










9 


100.00 


100.00 


total 


9 


4 


9 


15 


9 


46 







10 



927 
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Tgble 31b: In total 184 individual assignments of ALL were analyzed. 182 of 184 assignments (98.9%) were 
correct The sensitivity indicated for each subgroup indicates the percentage of correct assignments for 
cases of the specific subgroup in pairwise comparisons. The specificity indicates for each subgroup the 
percentage of correct assignments to this subgroup. 



ALL t(4;1 1) ALL t(8;14) ALL B not ALL Ph T-ALL total sensitivity Specificity 

Ph % % 



ALLt(4;11) 
ALL t(8;14) 
ALL B not Ph 
ALL Ph 
T-ALL 
total 




36 



16 



36 



60 



■■Hi 36 



100.00 



100.00 



97.22 



98.33 



100.00 



100.00 



100.00 



97.22 



98.33 



100.00 



36 



16 



36 



60 



36 184 



928 
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Table 32: Analysis of 5 ALL subtypes according to the method described by Golub et al. 



ALL 



ALL t(4;11) 
ALL t(8;14) 
ALL B not Ph 
ALL Ph 
T-ALL 



n 
9 
4 
9 
15 



ALLt(4;11) vs. all other 


samples: 9 / 37 








Accuracy 


1 








Confidence 


1 








Gene 


signal-to-noise 


P 


decision limit 


gene symbol 


219033_at 


3.20704240055607 


0* 


2553.55 


FLJ21308 


219463_at 


2.92935553285382 


0 




C20orf103 


204069_at 


2.82114201903624 


0 




MEIS1 


237431_at 


2.55145965416011 


0 






205899_at 


2.08001706361176 


0 




CCNA1 


221969_at 


2.05226504657001 


0 




PAX5 


225592_at 


1.98551092155834 


0 




NRM 


24241 4_at 


1.90103345468382 


0 






225563_at 


1.85734510779608 


0 






201105_at 


1.84845429911833 


0 




LGALS1 


243756_at 


1.82938790694615 


0 






213894_at 


1.79258061939732 


0 




KIAA0960 


215925_s_at 


1.74909667158721 


0 






228083_at 


1.74681828228852 


0 






204044_at 


1.72555848313614 


0 




QPRT 


209170_s_at 


1 .69865585409382 


0 




GPM6B 


209168_at 


1.67497314565832 


0 




GPM6B 


210934_at 


1.66143224232084 


0 




BLK 


238750_at 


1.59639718466905 


0 






224681_at 


1.58202928459614 


0 




GNA12 
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ALL t(4;11) vs. ALL t(8;14) 

accuracy 

confidence 

gene 

210045_at 

240106_at 

202853_s_at 

242434_at 

237431 _at 

201540_at 

215855_s_at 

212357_at 

204798_at 

226795_at 

77508_j_at 

46142_at 

225277_at 

210934_at 

215537_x_at 

232201 _at 

214505_s_at 

244261_at 

20861 4_s_at 

210298_x_at 



samples: 9/4 

1 

1 

signal-to-noise p 

6.76734853184964 0* 

6.10571301118426 0 

-5.88804457870992 0 

-5.60754470569171 0 

5.31661820706767 0 

4.94655082712075 0 

4.71299810202736 0 

4.18767818184794 0 

4.12313508850913 0 

4.12248256776444 0 

4.0317985345148 0 

3.95748459279267 0 

3.7552029934786 0 

3.72127171897839 0 

3.53324247477066 0 

3.50091671488931 0 

3.42260996379197 0 

3.40910490910101 0 

3.34565043119022 0 

3.30171221592859 0 



decision limit 

142.15 



gene symbol 
IDH2 

RYK 



FHL1 

KIAA0280 
MYB 

FLJ23282 
FLJ 12681 

BLK 

NKD2 
FHL1 

FLNB 
FHL1 
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ALL t(4;1 1) vs. ALL B not Ph 

accuracy 

confidence 

gene 

237431_at 

219033_at 

219463_at 

204069_at 

201105_at 

200907_s_at 

24241 4_at 

222492_at 

230441_at 

235291 _s_at 

225592_at 

200906_s_at 

201153_s_at 

201152_s_at 

241985_at 

213894_at 

243756_at 

225563_at 

232231_at 

240581 at 



samples: 9/9 

1 

1 



signal-to-noise 


p 


decision limit 


gene symt 


5.31661820706767 


0* 


49 




3.19032095561144 


0 




FLJ21308 

■ IvWA. 1 WWW 


2.69567768562793 


0 




C20orf 1 03 


2.54127866831197 


0 




MEIS1 


2.30596776500018 


0 




LGALS1 


2.19034049161844 


o 






2.08870062415486 


o 






-2. 04226084466602 


0 






-^.Uo/ I / OUOO/ O^fOD 


u 






2.0210425168076 


0 






2.01329642963674 


0 




NRM 


1.9632986862999 


0 






1.93254941630797 


0 




MBNL 


1.93227192981893 


0 


• 


MBNL 


-1.91875000661653 


0 






1.89533012552966 


0 




KIAA0960 


1.82938790694615 


0 






1.82046495626766 


0 






1.8138577510169 


0 






1.80942575017411 


0 
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ALL t(4;11) vs. ALL Ph 


samples: 9/15 








accuracy 


1 








confidence 


1 








gene 


signal-to-noise 


P 


decision limit 


gene symbol 


•204069jat 


3.31118886883646 


0* 


482.55 


MEIS1 


219033_at 


3.24786485857293 


0 




FLJ21308 


219463_at 


2.78353259146178 


0 




C20orf103 


221969_at 


2.4416835946504 


0 




PAX5 


201874_at 


2.38057857279198 


0 




FLJ21047 


209170_s_at 


2.31526351178702 


0 




GPM6B 


233500_x_at 


2.23629715560156 


0 




LLT1 


205899_at 


2.23375311954146 


0 




CCNA1 


24241 4_at 


'2.21567202901383 


0 






205821_at 


2.10144186601662 


0 




D12S2489E 


205055_at 


2.04951650013049 


0 




ITGAE 


209168_at 


2.02899949049115 


0 




GPM6B 


226939_at 


( 1. 995684021 07224 


0 






209354_at 


-1.9938191217443 


0 




TNFRSF14 


200906_s_at 


1.9632986862999 


0 






225563_at 


1.88712595721941 


0 






237431_at 


1.87880985981148 


0 






3421 0_at 


-1.86187594200254 


0 




CDW52 


202853_s_at 


-1.84984316383277 


0 




RYK 


209167_at 


1.84829764568262 


0 




GPM6B 
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ALL t(4;11) vs. T-ALL 


samples: 9/9 








accuracy 


1 








confidence 


1 








gene 


signal-to-noise 


P 


decision limit 


gene symbol 


221969_at 


5.83664657385464 


0* 


1301 


PAX5 


237431_at 


5.31661820706767 


0 






213772_s_at 


4.80540935532116 


0 




GGA2 


219463_at 


3.9253737114322 


0 




C20orf103 


210934_at 


3.72127171897839 


0 




BLK 


242292_at 


-3.53231266693437 


0 






226496_at 


3.49024496949904 


0 






244261 _at 


3.40910490910101 


0 






205640_at 


3.04306419050975 


0 




ALDH381 


219033_at 


3.03434195473282 


0 




FLJ21308 


209168_at 


3.02932118514235 


0 




GPM6B 


205821_at 


2.9956567374574 


0 




D12S2489E 


235706_at 


2.99518505957364 


0 




CPM 


225592_at 


2.98114633774562 


0 




NRM 


22531 4_at 


-2.87525509857994 


0 






209170_s_at 


2.832580891342 


0 




GPM6B 


215925_s_at 


2.79328349720245 


0 






204069_at 


2.67025815490556 


0 




MEIS1 


210192_at 


2.63764031456982 


0 




ATP8A1 


217080_s_at 


2.60699026264913 


0 




HOMER-2B 



933 
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ALL t(8;14) vs. all other 


samples: 4/42 






accuracy 


1 






confidence 


0.920758831240362 






gene 


signal-to-noise 


P 


decision limit gene symbol 


225277_at 


-2.57998811180236 


0 




22821 1_at 


-2.14623380341388 


0* 


20.8 


202249_s_at 


-1,96332308011424 


0* 


1 H326 


221834_at 


-1.9031437121116 


0 




202262_x_at 


-1.89325979398387 


0 


DDAH2 


215537_x_at 


-1.86843810729553 


0 




209253_at 


-1.83062488265555 . 


0 


SCAM-1 


218836_at 


1.75015142111032 


0* 


431 .25 FLJ22638 


201540_at 


-1.7433648326086 


0 


FHL1 


203373_at 


-1.73135591115253 


0 


STATI2 


212357_at 


-1.70867340748188 


0 


KIAA0280 


229061_s_at 


1.637709601518 


0 


SLC25A13 


211540_s_at 


-1.62480264702951 


0 


RB1 


213639_s_at 


-1.6005915843515 


0 


KIAA0557 


207971_s_at 


-1.60022833182646 


0 


KIAA0582 


202328_s_at 


-1.58949139934067 


0 


PKD1 


23601 9_at 


-1.57756633888103 


0 




3661 2_at 


-1.57219062906198 


0 


KIAA0280 


211031_s_at 


-1.56936126526028 


0 


CYLN2 
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ALL t(8;14) vs. ALL B not Ph 

accuracy 

confidence 

gene 

231567_s_at 

225277_at 

23601 9_at 

239835_at 

235287_at 

221834_at 

224221 _s_at 

202249_s_at 

222275_at 

231181_at 

202137_s_at 

212535_at 

244230_at 

215622_x_at 

23381 3_at 

209891_at 

203373_at 

213504_at 

237006_at 

AFFX-r2-Hs1 8SrRNA-M_x_at 
U133B 



samples: 4 / 9 
1 

1, 

signal-to-noise 

-2.72106559758946 
-2.51129165288142 
-2.39137207272003 
2.1931044914088 
-2.11679606735121 
-2.07908926017389 
-2.01560021968657 
-2.00760725399781 
-1.99241461922196 
1 .9495774287402 
-1.92788330147706 
-1.92074458469524 
-1.90852603079528 
1.89164878686732 
-1.8799760260639 
1.87886429835728 
-1.86193326506748 
1.8188709452285 
-1.7948046900018 
HG- 1.2766621379286 



p decision limit 

0* 34.3 

0 

0 

0 

0.01 
0.01 
0.01 
0 

0.01 
0 

0.01 
0.01 
0 

0.01 

0.01 

0 

0 

0 

0.02 
0.01 



gene symbol 
TSP-NY 



KIAA1842 



VAV3 
H326 



BS69 



HSPC226 

AD024 

STATI2 

MOV34-34KD 
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A'LL t(8;14) vs. ALL Ph 


samples: 4/15 








accuracy 


1 








confidence 


1 








gene 


signal-to-noise 


P 


decision limit 


gene syml 


204663_at 


-3.923285216392 


0* 


80.55 


ME3 


203373_at 


-3.79361888147031 


0 




STATI2 


207971_s_at 


-3.75632756767036 


0 




KIAA0582 


221834_at 


-3.73210732395713 


0 






212535_at 


-3.35180803492209 


0 






210487_at 


-2.66400596339357 


0 




DNTT 


226607_at 


-2.65303285070543 


0 




L3MBTL 


214505_s_at 


-2.30002142127038 


0 




FHL1 


226545__at 


-2.26350222846019 


0 






201540_at 


-2.21889045310099 


0 




FHL1 


209253_at 


-2.17983750831107 


0 




SCAM-1 


228496_s_at 


-2.17784383479245 


0 




CRIM1 


213854_at 


-2.14963630962421 


0 




SYNGR1 


22821 1_at 


-2.14770747938169 


0 






212012_at 


-2.10857376713896 


0 




D2S448 


20821 7_at 


-2.10769716540219 


0 




GABRR2 


20251 9_at 


-2.05798709970069 


0 




MONDOA 


218836_at 


2.05747195088222 


0 




FLJ22638 


211031_s_at 


-2.05686503189171 


0 




CYLN2 


50277_at 


1.4804958933411 


0 




GGA1 
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accuracy 


1 








confidsnpp 


, 1 








gene 




P 


utzoioiun limit 


gene syrnuc 


213772 s at 


4 QQQ41 70^Q1 8849 








236019 at 


-4 flfl^^71019Q^Q^ 


U.VJ I 






225277 at 


-*t . DjJ JJOUUUU 57 O 


n 






40148 at 


^ c S6 x vt94 £ ^fi'3fi9S'3Q 


VJ.VJ I 




ADRDO 


228211 at 




VJ.U I 






901^4 <; at 


o.ozo i zuyzzzu i uo 


n 
u 




ADUPCC"! O 


901417 at 


9 74^^Q0^4QR^Rn^ 


n 






90R941 at 


-z.DQyooouyoo'fouD 


u 




KPNA5 


90ftQ1 ft q at 


z.oyu/ io^,oo*tuo/o 


n 
U 




rLJ 13052 


910fV3fl at 




n 
U 






99 c v7'3 c i at 




U 






209253_at 


-2. 33749346955264 


0 




SCAM-1 


202262_x_at 


-2.29867528817227 


0 




DDAH2 


225129_at 


2.29232308514395 


0.01 


1 


MDS026 


221969_at 


2.278395672233 


0 




PAX5 


225080_at 


2.24115131588386 


0.01 




MY01C 


218338_at 


-2.22116851483018 


0.01 




EDR1 


208664_s_at 


-2.20362132175544 


0.01 




TTC3 


201029_s_at 


-2.14618908100153 


0 




MIC2 


56256_at 


1.43796973813133 


0 




LOC51092 
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ALL B not Ph vs. all other 

accuracy 

confidence 

failed: 

gene 

219358_s_at 

220744_s_at 

203808_at 

243228_at 

218517_at 

226646_at 

224739_at 

219036_at 

218464_s_at 

211953_s_at 

241383_at 

207403_at 

210519_s_at 

214144_at 

209135_at 

218543_s_at 

34210 at . 



samples: 9/37 
0.934782608695652 
0.902657186900126 
5,7,8 

signal-to-noise p 

-1 .0439321 5326702 0.01 * 

-1.00349163214942 0* 

-0.907795727801968 0 

0.879849065993121 0 

-0.867959639957627 0 

0.864565503015451 0* 

0.839978529078003 0 

-0.833911794382181 0 

-0. 83208295466961 2 0.01 

-0.830459962940188 0 

-0.81 9553908934904 0 

-0.81 2859958493534 0 

-0.801 6666742231 77 0 

-0.796475066163317 0 

-0.787608359617365 0 

0.7731 54455437408 0* 

0.555729083194618 0 



decision limit 

1 
1 



490.45 



765.05 



gene symbol 
CENTA2 
WDR10 
AKT2 

FLJ22479 
KLF2 
MG61 
BITE 

FLJ10700 
KPNB3 

IRS4 

POLR2D 
ASPH 
FLJ22693 
CDW52 
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ALL B not Ph vs. ALL Ph 


samples: 9/15 








accuracy 


0.916666666666667 








confidence 


,1 








failed: 


3,18 








gene 


signal-to-noise 


p 


decision limit 


aene svmbc 


219358_s_at 


-1.52735607926994 


0 




CENTA2 


241383_at 


-1.07499870203752 


0.01 






213895_at 


-1.05505220750298 


0 




EMP1 


202123_s_at 


-1.02481061931947 


0* 


753.7 


ABL1 


20591 1_at 


-1.0236634987836 


0.01 




PTHR1 


242223_at 


1.02343172223498 


0.01 






211709_s_at 


-1.01491744255679 


0 




SCGF 


234839_at 


-1.00185285072786 


o 






212150_at 


-0.996731200580515 


0 




KIAA0143 


221991_at 


-0.987741661696868 


0 




NXPH3 


218543_s_at 


0 97390903371224*3 






FLJ22693 


201874_at 


0.966766364385792 


0 




FLJ21047 


212188_at 


-0.947998533949464 


0 




LOC1 15207 


207520_at 


-0.936622132674122 


0 






213979_s_at 


-0.935519171149617 


0 




CTBP1 


229745_x_at 


-0.93424475879621 


0 






216680_s_at 


-0.934196756237652 


0.01 




EPHB4 


202572_s_at 


0.924395823520243 


0 




KIAA0964 


243228_at 


0.916138491705783 


0 
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ALL B not Ph vs. T-ALL 

accuracy 

confidence 

gene 

242292_at 

36566_at 

230636_s_at 

266_s_at 

209771_x_at 

219631_at 

202113_s_at 

216379_x_at 

221969_at 

229487_at 

208650_s_at 

218464_s_at 

213944_x_at 

226496_at 

202206_at 

205504_at 

213539_at 

211101_x_at 

244876_at 

209772 s at 



samples: 9/9 

1 

1 

signal-to-noise p 

-3.53231266693437 0* 

2.04687603382912 0 

1.96129371358941 0 

1.95685220231101 0 

1.95325312023269 0 

-1.90940390225071 0 

1.86969444770264 0 

1.84446878357139 0 

1.82924887172424 0 

1. 781 706858531 26 0 

1.77571318786079 0 

-1.71465707824422 0 

1.69297405468728 0 

1.65125933296341 0 

-1.58502966518677 0 

1.58481348793145 0 

-1.56405807954932 0 

1.55958748994576 0 

1.55727218904507 0 

1.54041193817483 0 



decision limit 

19.4 



gene symbol 

CTNS 

BTEB1 

CD24 

CD24 

FLJ12929 

SNX2 

PAX5 

CD24 
FLJ10700 



ARL7 
BTK 
CD3D 
LILRA2 

CD24 
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ALL Ph vs all other 

1 \ L- 1 — ■ II V O. ull WLI Iwl 


Qflmnlp*?' 15 / 31 








accuracy 


0 934782608695652 








confidpncfi 


0 864155721068077 
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5 1 1 37 
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O - 
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-0 Q22781 38863441 8 
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PI V>\CiA77 


204501 at 


O 896802057215648 
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MOW 


212998 x at 


O 8661 54Q8 180542 
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-O ft5Rfi3047fl8RQ903 


n* 


one QC 
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u 






90RQ40 q at 


u.ouoi o/oz/ 1 *+i / oy 
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0 




NOV 


212365_at 


0.782772195022069 


0 




MY01B 


204030_s_at 


0.779040391109499 


0 




SCHIP1 


207971_s_at 


0.777645713400436 


0 




KIAA0582 


224772_at 


0.777324945979331 


0 




MGC14961 


230659_at 


0.775582345606179 


0 




KIAA0212 


214051_at 


-0.763853883412362 


0 




TMSNB 


231897_at 


0.756283108145232 


0 






206302_s_at 


0.754443263514809 


0 




NUDT4 
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A| I DK \/q T Al 1 
nLL rn VS. 1 -nLL 


samples, io / a 








accuracy 
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1 OH-y £ t^53DOOD 
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1 7Q/PQQ/77r\Q07fi 
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HLA-UKA 


y bU4_s_at 


-1 . / ot>4ooo1 wioool 1 
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/""\ AT A O 
C^ATAO 


^uybiy — at 


i./sdf 4o / ybooyyoz 
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CD74 


00077-1 v at 


i .Dyuob4yyo4o4ob 


U 




CD24 


z i uyoz^s^at 


l .bol U4b / yu^oUo^c 


U 




1 11 a r\n a 

nLA-URA 


229487_at 


1 .662370779401 1 3 


0 






235706_at 


,1.65050775066494 


0 


i 


CPM 


226878_at 


1.64629305544324 


0 






219631_at 


-1.6447182870532 


0 




FLJ12929 


216379_x_at 


1.61950446576807 


0 






232234_at 


-1 .6009007845449 


0 




C20orf24 


210116_at 


-1.58593866198308 


0 




SH2D1A 


224772_at 


1.58464619249453 


0 




MGC14961 


213944_x_at 


1.58085800202538 


0 
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T-ALL vs. all other 
accuracy 
confidence . 
gene 

218224jat 

219631_at 

213539jat 

20977 1_x_at 

216379_x_at 

208894_at 

206804_at 

266_s_at 

232234_at 

210982_s_at 

213772_s_at 

235706_at 

206398_s_at 

242292_at 

228988_at 

221969_at 

202113_s_at 

202746_at 

226496_at 



samples: 9 / 37 

1 

) 

signal-to-noise 

1 .9746486763803 
1.74096689257481 
1.72260640123424 
-1.54682027770245 
-1.50086383881878 
-1.4723749670674 
1.44192284369386 
-1.41281622043848 
1.4114153057648 
-1.40110964085237 
-1.38475219732458 
-1.37948942896001 
-1.37387052076155 
1.3716037079612 
- 1.36889729345744 
-1.35174546838558 
-1.32897344000246 
1.32364323412114 
-1.30740184360227 



p 


/4A««!einn limit 

decision iimn 


npnp svmbol 


0 






U 




FLJ12929 


U 




CD3D 


0 






0 






0 




HI A-DRA 


U 




CD3G 


0 




CD24 


0 




C20orf24 


0 




HLA-DRA 


0 




GGA2 


0 




CPM 


0 




CD19 


0 






0 




ZNF6 


0* 


149.65 


PAX5 


0 




SNX2 


0 






0 
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Table 33: Analysis of 5 ALL subgroups according to the method described by Westfall & Young. The 23 significant genes 
are listed* The raw p value as well as the adjusted p value are shown. Furthermore it is indicated whether the genes were 
also identified as discriminative by the method described by Golub et al. (* most important gene in pairwise comparisons, 
+ one of the top 20 significant genes) 



genelD 


Golub 


rawp adjp 


Gene symbol 


219463_at 


+ 


0.0001 


0.0042 C20orf103 


204069_at 


* 


0.0001 


0.0046 MEIS1 


219033_at 


* 


0.0001 


0.0051 FLJ21308 


221969_at 


* 


0.0001 


0.0071 PAX5 


205899_at 


+ 


0.0001 


0.0081 CCNA1 


237431_at 


* 


0.0001 


0.0088 


242414_at 


+ 


0.0001 


0.0091 


225563_at 




0.0001 


0.0122 


215925_s_at 


+ 


0.0001 


0.0157 


209168_at 


+ 


0.0001 


0.0168 GPM6B 


204044_at 




0.0001 


0.0170 QPRT 


225592_at 


+ 


0.0001 


0.0174 NRM 


228083_at 


+ 


0.0001 


0.0179 


218224_at 




0.0001 


0.0190 PNMAl 


213539_at 


+ 


0.0001 


0.0195 CD3D 


213894_at 


+ 


0.0001 


0.0197 KIAA0960 


201105_at 


+ 


0.0001 


0.0199 LGALS1 


219631_at 




0.0001 


0.0199 FLJ12929 


209170_s_at 


+ 


0.0001 


0.0201 GPM6B 


243756_at 


+ 


0.0001 


0.0214 


209822_s_at 




0.0001 


0.0254 VLDLR 


228988_at 


+ 


0.0001 


0.0454 ZNF6 


235749_at 




0.0002 


0.0477 
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Table 34a: In total 173 cases of ANIL were analyzed. 160 of 174 cases (92.5%) were assigned to the correct 
AML subtype in all pairwise comparisons. The sensitivity indicated for each subgroup indicates the 
percentage of cases of the specific subgroup identified correctly in all pairwise comparisons. The specificity 
indicates for each subgroup the percentage of correct assignments to this subgroup. 





+8 


other 


complex 


normal 


t(8;21) 


t(15;17) 


inv(16) 


MLL 


total 


sensitivity % 


specificity % 


+8 


■ : :l 






4 










10 


60.00 


100.00 




other 




H 


1 


1 










5 


60.00 


100.00 


complex 






IP 


3 










36 


91.67 


94.29 


normal 






1 










2 


62 


95.16 


85.51 


t(8;21) 


















13 


100.00 


100.00 


t(15;17) 


















20 


100.00 


100.00 


inv(16) 


















12 


100.00 


100.00 


MLL 








2 










15 


86.67 


86.67 


total 


6 


3 


35 


69 


13 


20 


12 


15 


173 
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Table 34b: In total 1211 individual assignments of AML were analyzed. 1198 of 1 211 assignments (98.9%) 
were correct. The sensitivity indicated for each subgroup indicates the percentage of correct assignments 
for cases of the specific subgroup in pairwise comparisons. The specificity indicates for each subgroup the 
percentage of correct assignments to this subgroup. 



5 





+8 


other 


complex 


normal 


t(8;21) 


t(15;17) 


inv(16) 


MLL 


total 


sensitivity % 


specificity % 


+8 


!:6S 

KB 






4 










70 


94.29 


100.00 


other 




;•; : -.33 

Sllll 


1 


1 










35 


94.29 


100.00 


complex 








3 










252 


98.81 


99.20 


normal 






1 


; 








2 

i 


434 


99.54 


. 97.74 


t(8;21) 



















91 


-100.00 


100.00 


t(15;17) 












• | 140 






140 


100.00 


100.00 


inv(16) 














64 




84 


100.00 


100.00 


MLL 








2 










105 


98.10 


98.10 


total 


66 


33 


251 


442 


91 


140 


84 


105 


1211 
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Table 35: Analysis of 8 AML subtypes according to the method described by Golub et at. 



AML subtype 

trisomy 8 

other aberrant 

complex 

normal 

t(8;21) 

t(15;17) 

inv(16) 

MLL 



n 

10 
5 
36 
62 
13 
20 
12 
15 



trisomy 8 vs. aii other 


samples: 10/163 






accuracy 


0.959537572254335 






confidence 


0.569726638054273 






failed: 


1,2,9,10,32,44,50 




• 


gene 


signal-to-noise 


P 


decision limit gene symbol 


214394_x_at 


1.01312549146188 


0 


FLJ20897 


222166_at 


-0.966941507782627 


0 




242975_s_at 


-0.84543459151348 


0 


GNAS 


218549_s_at 


0.837748897801293 


0* 


1095.85 LOC51115 


218642_s_at 


0.837443970437429 


0 


MGC2217 


212250_at 


0.837142769828465 


0 




203110_at 


0.812826558097073 


0 


PTK2B 


219518_s_at 


-0.798960392573077 


0.01 


FLJ22637 


237068_at 


-0.76378369742971 


0 




206781__at 


-0.759613817823006 


0 


DNAJC4 


224804_s_at 


-0.756694932705637 


0.01 


FLJ00005 


235647_at 


-0.746385695306755 


0* 


1 


203007_x_at 


0.741371207542559 


0 




212449_s_at 


0.729872362915738 


0 


LYPLA1 


231981_at 


0.722528507843912 


0* 


199.8 


218482_at 


0.71984358195818 


0 


DC6 


224076_s_at 


0.717538242394189 


0 


WHSC1L1 


231101_at 


-0.7149698250145 


0 


PPP2R5E 
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trisomy 8 vs. other aberrant 

accuracy 

confidence 

gene 

229848_at 

239538_at 

23071 3_at 

209490_s_at 

229362_at 

225073_at 

239699_s_at 

242363_at 

201974_s_at 

219027_s_at 

234643_x_at 

236837_x_at 

226783_at 

238039_at 

224044_at 

226324_s_at 

227587_at 

229637_at 

227744_s_at 

635 s at 



samples: 10/5 

1 

1 

signal-to-noise p 

-2.61497492874017 0* 

-2.22698957925109 0 

-2.16807291296049 0 

-2.0861 3392690798 0 

-2.0821284659189 0 

-2.06488846603302 0 

-2.02729155150144 0 

-1.94643377690021 0 

-1.88600099933531 0 

-1.86639067868364 0 

1.78480980429735 0 

-1.7783166983573 0 

-1.75409747751331 0 

-1 .74203897189399 0.01 

-1.72117511206513 0.01 

-1.71772128724622 . 0 

-1.71651011254536 0 

-1 .71 361 022272893 0.01 

-1.70786273052662 0 

1.25062226167699 0 



decision limit gene symbol 
36.7 ZNF10 



PPT2 
HSPC232 



LOC51622 
MY09A 



FLJ 11040 
SLB 

MGC 15906 

HNRPD 
PPP2R5B 
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trisomy 8 vs. complex 


samples: 10/36 








accuracy 


1 








confidence 


0.855027999708228 








gene 


signal-to-noise 


P 


decision limit 


gene symDoi 
• 


222229_x_at 


1.23332137897808 


0 






208697_s_at 


1.15066098039301 


0 




riro PC 


205849_s_at 


1.14869307530221 


0 




UQCRB 


212586_at 


1.11153173777253 


0* 


1231.4 


AKTo-1 


208646_at 


1 .07422424644875 


0 






230795_at 


-1.06732853628123 


0 




H4F2 


202746_at 


-1.03422849717783 


0 






226545_at 


-1.00972156245463 


0* 


100.15 




222166_at 


-0.999828982909011 


0* 


1 




210715_s_at 


-0.978857795907469 


0* 


573.3 


SPINT2 


228652_at 


-0.97451 1468277957 


0 






217979_at 


-0.973647566321829 


0* 


234.35 


NET-6 


201548_s_at 


-0.970651188910704 


0* 


289.85 


PLU-1 


202747_s_at 


-0.952100249128964 


0 




ITM2A 


201602_s_at 


-0.951073671804128 


0 




PPP1R12A 


205674_x_at 


-0.945871996779656 


0 




FXYD2 


244740_at 


-0.942099475768156 


0 






219518_s_at 


-0.938970502867646 


0 




FLJ22637 


227249_at 


-0.931085853722028 


0 




NUDE1 



BNSDOCID: <WO 03039443A2_I_> 



949 



WO 03/039443 



PCT/EP02/12303 



trisomy 8 vs. normal samples: 10/62 



accuracy 


0.944444444444444 








confidence 










failed: 


12 3 5 








gene 


signal-to-noise 


p 


decision limit 


gene symbol 


214394_x_at 


1 16270137582694 


0 




FLJ20897 


235124_at 


-1 .01 432223685267 


0 






203007_x_at 


0.98341 4984588329 


0 






206781 _at 


-0.9701 85909793454 


0 




DNAJC4 


24297o_s_at 


-0.968085578079267 


0 




GNAS 


203110_at 


0 964568800558985 


0 




PTK2B 


212251_at 


0 940237396721 025 


0* 


4073.8 




225406_at 


-D Q^9Q4321 4378223 


0 




TSG 


234726_s_at 


-0 90064273536451 9 


0 






218482_at 




o 




DC6 


23729 1_at 


-0.889981 273455646 


0 






23564 /_at 


-0.884166779739501 


0 






236837_x_at 


-0.866101703732076 


0.01 






222166_at 


-0.86434387750654.1 


o' 






225889_at 


-0.864142405574917 


0 






217994_x_at 


-0.85609014416877 


0 






203602_s_at 


-0.841071963439903 


0 




ZNF151 


228092_at 


-0.837138335753897 


0 






224804_s_at 


-0.83110664616695 


0 




FLJ00005 


212250_at 


0.827294640416809 


0 







950 

BNSOOCID: <WO 03039443A2_I_> 



WO 03/039443 



PCT/EP02/12303 



trisomy 8 vs. t(8;21) 


samples: 10/13 








accuracy 


1 








confidence 


1 








gene 


signal-to-noise 


p 


decision limit 


aene svrr 


228827_at 


-2.03460798747208 


0* 


162.15 




203188_at 


1.98171895993959 


0 




B3GNT6 


209522_s_at 


1.87499941760774 


0 




CRAT 


50221_at 


1.77237042593137 


0 






212586_at 


1.70313290894867 


0 




ARTS-1 


217816_s_at 


1.69811663165145 


0 




ocnD 


219084_at 


1.68534097481951 


0 




NSD1 


241370_at 


1.66453119103938 


0 






221581_s_at 


1.64609943226078 


0 




WBSCR5 


205528_s_at 


-1.64467496425566 


0 




CBFA2T1 


2l1341_at 


-1.63310072579237 


0 




POU4F1 


212250_at 


1.62854737716103 


0 






206940_s_at 


-1.60737424398258 


0 




POU4F1 


205529_s_at 


-1.591491111654 


0 




CBFA2T1 


213150_at 


1.57881564631984 


0 




HOXA10 


214394_x_at 


1.57039311243451 


0 




FLJ20897 


212895_s_at 


1.56878968456679 


0 




ABR 


218341_at 


1.53928870336148 


0 




FLJ 11838 


204249_s_at 


1.52597204656408 


0 




LM02 


214651_s_at 


1.52240931149048 


0 




H0XA9 



951 

BNSDOCID: <WO 03039443A2_I_> 



WO 03/039443 



PCT/EP02/ 12303 



trisomy 8 vs. t(15;17) 


samples: 10/20 








accuracy 


1 








confidence 


1 








gene 


signal-to-noise 


P 


decision limit 


gene symb 


214450_at 


.3.0865086945171 


0* 


1926.7 


CTSW 


212953_x_at 


-2.71319553382926 


0 




CALR 


236787_at 


-2.4273500798775 


0 






200952_s_at 


-2.39696270141848 


0 




CCND2 


205624_at 


-2.38480346275743 


0 




CP A3 


38487_at 


-2.38251622694176 


0 




FLJ12442 


205614_x_at 


-2.24846130719191 


0 




MST1 


20676t_at 


-2.20501899975366 


0 




TACTILE 


216320_x_at 


-2.16690689445934 


0 






224794_s_at 


. -2.14479331910386 


0 




LOC51148 


233072_at 


-2.07721746169547 


0 




KIAA1857 


221004_s_at 


-2.05892662897696 


0 




ITM3 


227326_at 


-2.04924504239987 


0 






221980_at 


-2.01752809990056 


0 






225547_at 


* 1.98126007880124 


0 ' 






210145_at 


1.95455412087558 


0 




PLA2G4A 


212509__s_at 


-1.92046303571342 


0 






209344_at 


-1.90906303754685 


0 




TPM4 


201029 s at 


-1.87538506046111 


0 




MIC2 



952 

BNSDOCID: <WO 03039 443A2J _> 



WO 03/039443 



PCT/EP02/12303 



trisomy 8 vs. inv(16) 


samoles* 10/12 








accuracy 












1 








gens 






decision limit 


npnp Q\/nihoi 
yciic o y 1 1 ikJKj i 


233138 at 


-3 47351 238271 1 4 


0* 


54.25 


• 


209365 s at 


-3 OOQRfififi l S47018Q 


o 




FPM1 




-2 81 807078Q81 R5 






9FRPINP1 


218Q49 at 




n 




P! 1990*^ 


901898 y at 


2 91 18^974981 QQR 


n 






900Q^1 «? at 




n 








9 01 nft94fi^9R1 149 


n 
u 




LUU I ZOU I o 


Z\Jo I OO ol 




u 






9090***% at 


1 Q147QfY3fiQ791 P 


u 




X ID9 




-i.yuo/ /uoooooo/ o 


u 




LSI A AnCOC 




i .suoooy/ ou i oooo 


n 
u 




UBtZto 


204oo1_at 


-1 .83776082o26379 


0 




CDW52 


241525_at 


-1.83006374766269 


0 






20871 0_s_at 


1.80896001968559 


0 




AP3D1 


3421 0_at 


-1.75572696362264 


0 




CDW52 


201497_x_at 


-1.74852845185764 


0 




MYH11 


212236_x_at 


-1.74679406476776 


0 






213810_s_at 


-1.69077455623534 


0 




FLJ10342 


212250_at 


1.62885302351785 


0 






202370_s_at 


1.61951331078439 


0 




CBFB 



953 

BNSDOCID: <WO 03039443A2_I_> 



WO 03/039443 



PCT/EP02/12303 



trisomy 8 vs. MLL 
accuracy 
confidence 
gene 

200056_s_at - HG-U133A 

212250_at 

228024_at 

205355_at 

225700_at 

234726_s_at 

20261 9_s_at 

205453_at 

202823_at 

210749_x_at 

227786_at 

212479_s_at 

219312_s_at 

218172_s_at 

200867_at 

202956_at 

213902_at 

239597_at 

214789_x_at 

203053 at 



samples: 10/15 
1 

0.8689072208975 

signal-to-noise p 

1.60303643777462 0 

1.47262825515036 0 

1.43968630786794 0 

1.39534301157978 0 

1.39419450937928 0 

-1.36257922965513 0 

1.34323877280623 0 

1.29520111839967 0 

1.28255326311509 0 

1.26893758929818 0 

1.24020773870069 0 

1.23922427261637 0* 

1.21983738431425 0 

1.21403337241271 0* 

1.2110165320696 0*' 

1.21043821718201 0* 

1.19710457307816 0 

-1.19444081572455 0 

1.19367799101574 0 

1.18683041233911 0 



decision limit gene symbol 
C1D 

PAK1 
ACAD SB 



PLOD2 
HOXB2 
TCEB1 
DDR1 
TRAP25 
437.3 FLJ13910 
RINZF 

285.95 PR02577 
531 

550.6 BIG1 
ASAH 

SRP46 
BCAS2 



954 



BNSDOCID: <WO_ 



_03Q39443A2J_> 



WO 03/039443 



PCT/EP02/12303 



other aberrant vs. all other 

accuracy 

confidence 

failed: 

gene 

233328_x_at 

219156_at 

214310_s_at 

229003_x_at 

213725_x_at 

236648_at 

226634_at 

224664_at 

219337_at 

228660_x_at 

217375_at 

219027_s_at 

239538_at 

203796_s_at 

211918_x_at 

241795_at 

222147_s_at 

227206_at 

635 s at 



samples: 5/168 
0.994219653179191 
0.899486235418654 
4 

signal-to-noise p 

-1 .52904798962408 0.01 

1.27140814977495 0 

-1.27139871979195 0 

-1.20449873883071 0 

1.1911680827765 0 

1.15777539407154 0* 

1.15695756502745 0 

1.14829275512581 0 

-1.09306764196133 0 

1.09254562955252 0* 

-1.08474018637782 0 

1.0790791377144 0 

1.07873188809285 0 

1.04936325547633 0 

-1 .04651632950823 0.01 

-1.04488187252 0 

-1 .02048867081 398 0.01 

-1 .01 633702093738 0.01 

-1.01612668262432 0 



decision limit gene symbol 

FLJ11271 
ZFPL1 



82 



FLJ20584 
SEMA4F 

MY09A 

BCL7A 
PLAC3 



PPP2R5B 



955 



BNSOOCID: <WO 03039443A2_I_> 



WO 03/039443 



PCT/EP02/1 2303 



other aberrant vs. complex 

accuracy 

confidence 

failed: 

gene 

225804_at 

211819_s_at 

233328_x_at 

229003_x_at 

22371 2_at 

635_s_at 

231002_s_at 

241734_at 

201530_x_at 

211918_x_at 

222229_x_at 

231945_at 

202150_s_at 

212171_x_at 

214310_s_at 

219337_at 

235263_at 

233195_at 

210817_s_at 



samples: 5/36 
0.975609756097561 
0.990141898103344 
2 

signal-to-noise p 

1.56382089574999 0 

-1.45297162934509 0* 

-1.44097598487297 0* 

-1.40901327014506 0* 

1.36997409793585 0 

-1.34942029903122 0 

1.3258291286036 0 

1.32533459318655 0 

1.3093371019822 0 

-1 .302641 050371 78 0.01 

1.214222183319 0 

-1.2091074222176 0.01 

-1.2052224977219 0 

-1.20353762848401 0 ' 

-1.19546833367549 0 

-1.19331909893746 0 

1.19244840508646 0 

-1.19077762820993 0 

1.17387606136153 0 



decision limit gene symbol 



SH3D5 



DCOHM 

PPP2R5B 

NUP88 

EIF4A1 
PLAC3 

K1AA1275 

HEF1 

VEGF 

ZFPL1 

FLJ20584 

DKFZP434A0131 

NDP52 



BNSDOCID: <WO 03039443A2_1_> 
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other aberrant vs. normal 

accuracy 

confidence 

failed: 

gene 

233328_x_at 

220924_s_at 

213725_x_at 

229003_x_at 

207057_at 

239393_at 

218041_x_at 

219156_at 

227206_at 

217375_at 

219337_at 

214894_x_at 

225452_at 

205316_at 

212469_at 

214310_s_at 

226634_at 

234132_at 

215115_x_at 

45633 at 



samples: 5/62 
0.985074626865672 
0.991986319540221 
4 

signal-to-noise p 

-1.9106472582954 0* 

1.36395677335271 0 

1.33852421412964 0* 

-1.33766416205182 0 

1.32428111403211 0 

1.30248902092406 0 

1.28800254801096 0 

1.27488096632736 0 

-1.24583383572442 0 

-1 .22806408433652 0.01 

-1 .2222324266691 3 0.01 

1.21435713598714 0 

1.19358949672009 0 

1.18925500392873 0 

1.17704907364629 0 

-1.176580899867 0.01 

1.15416325511173 0 

-1 . 1 4588852496026 0.01 

-1.14294381059057 0 

0.924726636126963 0 



decision limit gene symbol 



1096.35 



SLC38A2 

SLC16A7 

PRO1068 
FLJ11271 



FLJ20584 

MACF1 

PPARBP 

SLC15A2 

IDN3 

2FPL1 



NTRK3 
FLJ13912 



957 



BNSDOCID: <WO_ 



_03039443A2_L> 



WO 03/039443 



PCT/EP02/12303 



other aberrant vs. t(8;21) 


samples: 5/13 








accuracy 


1 








confidence 


1 








gene 


signal-to-noise 


P 


decision limit 


gene symbi 


221770_at 


7.99150022255116 


0* 


55.8 


RPE 


205607 s at 


5.76670522441814 


0 




LOC57147* 


225670 at 


3.48256305910835 


0 






^4fi89 at 


3.47855359314813 


0 




TREX1 




3.25650613162099 


0 




BCL7A 


226634 at 


3.21064553321956 


0 






232127 at 


2.70698092772246 


0.01 






200659 s at 


2.68716126436031 


0 




PHB 


(lUZZOO al 


2.6425819500005 


0 




FRAP1 


9^400*5 y at 


2.64219114616805 


0 




STK36 


212309 at 


2.52957598578167 


0 




CLASP2 


221206 at 


2.4774418396665 


0 




FLJ21459 


222163 s at 


2.44259488401336 


0 




MGC5347 


210128_s_at 


2.40486023166818 


0 




LTB4R 


242448_at 


2.40485463677367 


0 






212895_s_at 


2.38453513890117 


0 




ABR 


213313_at 


2.35617502073221 


0 




GAPCENA 


220796_x_at 


2.33914050862301 


0 




FLJ 14251 


242363_at 


2.29793881298951 


0 






225902_at 


2.28202551140514 


0 







958 

BNSDOCID: <WO 03039 443A2J_> 



WO 03/039443 
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other aberrant vs. t(15;17) 


samples: 5/20 








accuracy 


1 








confidence 

i 


1 








gene 


si anal -to- noise 


H 


Hpoicinn limit 


gene symoi 


213716_s_at 


5 68177086653034 

W >WV Mil www wO WW w i 


0* 




otU 1 Ml 


235483_at 


3 71724867723655 


o 






224664_at 


3 45875495264879 


o 






214310_s_at 


-3 345175974803 


n 




7CDI <\ 


38487_at 


-3 26848789132101 


n 




PI 11 O^d/IO 

r L J 1 ^^^Z 


210128_s_at 


3 21 3874201 1 46? S 


n 




L 1 D4K 


214450 at 




n 




CTSW 


209428 s at 




n 
u 




7rni a 

ZFPL1 


233328_x at 




n 
u 






201034 at 




n 
u 




ADD3 






0 




CALR 


201753_s_at 


2.7070306360752 


0 




ADD3 


208629_s_at 


-2.67288815920275 


0 




HADHA 


205668_at 


2.67287531876386 


i 

0 




LY75 


229937_x_at 


2.57076096076482 


0 






227701_at 


2.49972617912329 


0 






203948_s_at 


-2.49051133555807 


0 




MPO 


219280_at 


2.49000744284001 


0 




WDR9 


227999_at 


2.46891417509171 


0 




LOC1 70394 



BNSDOCID: <WO 03039443A2_I_> 



959 



WO 03/039443 
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other aberrant vs. inv(16) 

accuracy 

confidence 

gene 

227224_at 

212469_at 

229003_x_at 

239538_at 

216399_s_at 

223471_at 

201338_x_at 

231926_at 

202370_s_at 

224664_at 

204858_s_at 

227701 _at 

212686_at 

242525_at 

211824_x_at 

218259_at 

229848_at 

238791 _at 

212287_at 

205055_at 



samples: 5/12 

1 

1 

signal-to-noise P 

4.52730002135185 0* 

4.42589960567721 0 

-3.90005268853897 0 

3.74361173339624 0 

3.5206821287331 0 

3.22276962259045 0 

3.16855282128268 0 

3.15052042879849 0 

3.09376701617797 0 

3.04111626168471 0 

-2.96720397750981 0 

2.92458695035083 0 
2.82379160235424 0 
2.80598603189029 0 
-2.77113099215966 0 
2.6673432505011 0 
2.61497492874017 0 
2.55305211995074 0 
2.52556127587716 0 
2.49589801203404 0 



decision limit gene symbol 

67.25 

1DN3 • 



GTF3A 

CBFB 

ECGF1 

KIAA1157 

DEFCAP 

KIAA1243 

ZNF10 

JJAZ1 
ITGAE 



BNSDOC1D: <WO 03039443A2J_> 



960 
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other aberrant vs. MLL samples: 5/15 

accuracy 1 

confidence . 1 

gene signal-to-noise p 

232125_at 2.83634822109321 0* 

222335_at 2.80079072544706 0 

203593_at 2.5044381 0972067 0 

220924_s_at 2.44940338874045 0 

234584_s_at 2.3647381 861 9324 0 

228328_at 2.2823093126082 0 

203177_x_at 2.20405255989013 0 

218041_x_at 2.19960407203391 0 

222982_x_at 2.1705009156018 0 

240201_at 2.17038421694015 0 

223401_at 2. 1 574569971 375 0 

207057_at 2.10339239461692 0 

218499_at 2.08679728034545 0 

231926_at 2.03322303603834 0 

202926_at 2.01 1 827531 57743 0 

21 6399_s_at 1 .98784990984825 0 

21 2549_at 1 .9826673991 7742 0 

203897_at 1 .9541 2833774499 0 

226741_at 1 .9402502875491 3 0 

243154 at 1.93816368688103 0 



decision limit gene symbol 

10.45 

CD2AP 

SLC38A2 

ATE1 

TFAM 

PRO1068 

SLC38A2 

MDS006 

SLC16A7 

MST4 

NAG 



LOC57149 
LOC51234 



BNSDOCID: <WO 03039443A2_I_> 
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complex vs. all other 


samples: 36/137 








accuracy 


0.936416184971098 








confidence 


0.914058016154132 








failed: 


2,5,9,12,14,19,23,26,30,32,36 








gene 


signal-to-noise 


P 


decision limit 


gene sym 


222229_x_at 


-0.977113333847485 


0 




< 


22331 8_s_at 


-0.871134323886282 


0 




MGC1097. 


200608_s_at 


0.859861630456657 


0* 


1754.85 


RAD21 


209085_x_at 


0.827754504723174 


0* 


543.85 


RFC1 


201377_at 


0.821542753098708 


0 




KIAA0144 


214700_x_at 


0.816898518764863 


0 






227056_at 


-0.808158632738399 


0 




KIAA0141 


201164_s_at 


0.805491372224207 


0 




PUM1 


20241 3_s_at 


0.794856444512317 


0 




USP1 


209523_at 


0.794733968085209 


0 






205382_s_at 


-0.786501229240988 


0 




DF 


213452_at 


0.763286023547173 


0 




ZNF184 


203904_x_at 


0.761563303490256 


0 




KAI1 


212629_s_at 


0.758597736815579 


0 




PRKCL2 


219793_at 


0.753945856377937 


0 




SIMX16 


200093_s_at - HG-U133B 


-0.7505050474584 


0 




HINT1 


225065_x_at 


-0.740661805635393 


0 






21Q053_at 


0.737429178254302 


0 




TAF5 


209259_s_at 


0.734770265079843 


0 




CSPG6 


209023_s_at 


0.732408586508097 


0 




STAG2 
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BNSDOCID: <WO 03039443A2_I_> 



WO 03/039443 



PCT/EP02/12303 



complex vs. normal 

accuracy 

confidence 

failed: 

gene 

222229_x_at 

227056_at 

201922_at 

200093_s_at - HG-U133B 
239791 _at 
200608_s_at 
235502_at 
22331 8_s_at 

200023_s_at - HG-U133B 

200093_s_at - HG-U133A 

218645_at 

213846_at 

236728_at 

236892_s_at 

241395_at 

211950_at 

203345__s_at 

231277_x_at 

212251_at 



samples: 36 / 62 
0.959183673469388 
0.816646490079598 
9,23,36,88 

signal-to-noise p 

-1.12052810135302 0 

-1.02600466720905 0 

-0.97244940026771 3 0* 

-0.968640220043115 0 

-0.9272999424761 45 0* 

0.882207986996843 0* 

-0.870929654024877 0 

-0.867424298137502 0 

-0.86661263717398 0 

-0.86073091922176 0 

-0.842132977275412 0 

-0,824280554924366 0 

-0.81 9459743485574 0 

-0.810032594833065 0 

-0. 806502754696553 0 

0.803157394705844 0* 

0.79268995247901 0 

-0.79165327189793 0 

0.787798207070077 0 



decision limit gene symbol 

KIAA0141 
3970.8 YR-29 
HINT1 

54 HOXB6 
1566.2 RAD21 

MGC10974 

EIF3S5 

HINT1 

ZNF277 

COX7C 

HOXB6 

1367.5 RBAF600 
M96 
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WO 03/039443 



PCT/EP02/12303 



complex vs. t(8;21) 


samples: 36/13 








accuracy 


1 








confidence 


0.956366908409342 








gene 


signal-to-noise 


P 


decision limit 


gene symbol 


201851_at 


1.64654123030477 


0 




SH3GL1 


228827_at 


-1.63837794328364 


0* 


277.15 


• 


214651_s_at 


1.52618371146822 


0 




HOXA9 


235521 _at 


1.52417126144756 


0 




HOXA3 


204249_s_at 


1.52365997713542 


0* 


1581.95 


LM02 


203904_x_at 


1.49494399498725 


0 




KA11 


206940_s_at 


-1.48076766016125 


0 




P0U4F1 


211341_at 


-1.47540487159533 


0 




POU4F1 


209259_s_at 


1.45032016623088 


0 




CSPG6 


212058_at 


1.43907164424644 


0 




KIAA0332 


218577_at 


1.42304726090167 


0 




FLJ20331 


217963_s_at 


1.41730181206619 


0 




HCS 


'206622_at 


-1.41476408655437 


0 




TRH 


200071_at-HG-U133A 


1.4105258517176 . 


0 




SPF30 


218933__at 


1.38255573216414 


0 


i 


MGC5347 


205528_s_at 


-1.38150242627303 


0 




CBFA2T1 


218331_s_at 


1.3810903886183 


0 




FLJ20360 


202406_s_at 


1.3795610685034 


0 




TIAL1 


220796_x_at 


1.36377544613334 


0 




FLJ14251 


218582_at 


1.35888148459997 


0 




FU20445 



BNSDOCIO <WO 03039443A2_I_> 
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WO 03/039443 



PCT/EP02/12303 



complex vs. t(15;17) 


samples: 36 / 20 








accuracy 


1 








confidence 


1 








gene 


signal-to-noise 


P 


decision limit 


gene symbol 


205382_s_at 


-2.47024872277389 


0 




Ur 


212953_x_at 


-2.44599456599903 


0* 


4652.35 


GALK 


64942_at 


-2.283041 27550384 


0 






214450_at 


-2.262737051 81 24 


0 




U 1 oW 


38487_at 


-2. 1 5395063071 356 


0 




FLJ 12442 


224794_s_at 


-2. 1 447933 1 91 0386 


0 




1 Ar»C4 A AO 

LOC51 1 48 


220798_x_at 


-2.06303065394458 


0 




FLJ 1 1535 


216032_s_at 


-2.05259440043708 


0 




SDBCAG84 


203948_s_at 


-2.04245448483567 


0 




MPO 


209732_at 


1.90630142681759 


0 




CLECSF2 


230526_at 


1.90204644112897 


0 




FLJ20015 


238022_at 


-1.89121106100583 


0 






200654_at 


-1, 732094071 32843 


0 




P4HB 


204150_at 


-1.72000809746397 


0 




STAB1 


213447_at 


1.71480861978241 


0 




IPW 


206847_s_at 


1.68023930751716 


0 




HOXA7 


203074_at 


-1.65576107663154 


0 




ANXA8 


219837_s_at 


-1.65191807395586 


0 




C17 


20093 1_s_at 


1.6356222023809 


0 




VCL 


201923 at 


1.60784567114272 


0 




PRDX4 
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complex vs. inv(16) 
accuracy 
confidence 
gene 

203092_at 

209190_s_at 

201497_x_at 

205076_s_at 

241525_at 

213779_at 

210982_s_at 

200985_s_jat 

212463_at 

200675_at 

218942_at 

200984_s_at 

208894_at 

202265_at 

224724_at 

210715_s_at 

213452_at 

205382_s_at 

201360 at 



samples: 36/12 
1 

0.957308305034528 
signal-to-noise 

1.77664454556306 

-1.75723541848141 

-1.74852845185764 

-1.73951655525411 

-1.71682483225979 

1.71354352282537 

-1.70318998731519 

1.64643993864436 

1.6457941052799 

1.61546783522649 

1.59347299102441 

1.56833724351535 

-1.54975491884609 

1.53139324627965 

-1.5143785002027 

1.49004107536748 

1.48522101377482 

-1.45942422076027 

-1.45905524413008 



P 

0 

0* 

0* 

0 

0 

0 

0 

0 

0 

0* 

0 

0 

0 

0 

0 

0 

0 

0 

0 



decision limit 

1593.8 
134.75 



707.85 



gene symbol 
T1MM44 
DIAPH1 • 
MYH11 
CRA 



HLA-DRA 
CD59 

CD81 

FLJ22055 

CD59 

HLA-DRA 

BMI1 

KIAA1247 
SPINT2 
ZNF184 
DF 

CST3 
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BNSDOCID: <WO_ 



_03039443A2_I 



WO 03/039443 



PCT7EP02/ 12303 



complex vs. MLL 
accuracy 
confidence 
gene 

228083_at 

201105_at 

201377_at 

201 358_s_at 

203387_s_at 

201 585_s_at 

222982_x_at 

202746_at 

204951_at 

203725_at 

203544_s_jat 

225804_at 

203386_at 

201359_at 

201830_s_at 

218041_x_at 

22331 8_s_at 

212222_at 

201829_at 

200608 s at 



samples: 36/15 
1 

0.928537448772464 

signal-to-noise p 

-1.83691594955677 0 

-1.56013660814198 0 

1.5390279619994 0 

1.46121373620596 0 

1.43200690176451 0 

1.42612880048733 0 

1.4232823410753 0 

1.40541613814493 0* 

1.38290450200254 0* 

1.35090921185734 0 

1.33195316834084 0 

-1.31098146845679 0 

1.30903268579411 0 

1.3066438308133 0 

1.29170882160348 0 

1.27263528593652 0 

-1.27042416454958 0 

1.26786775770503 0 

1.26235979300155 0 

1.25213416895148 0 



decision limit gene symbol 

LGALS1 

KIAA0144 

COPB 

KIAA0603 

SFPQ 

SLC38A2 



277.15 
224.05 



ARHH 

GADD45A 

STAM 

KIAA0603 

COPB 

NET1 

PRO1068 

MGC10974 

KIAA0077 

NET1 

RAD21 



967 



BNSDOCID: <WO_ 



„03039443A2_I_> 



WO 03/039443 



PCT/EP02/12303 



normal vs. all other 

accuracy 

confidence 

failed: 

gene 

200023_s_at - HG-U133B 

236892_s_at 

201922_at 

239791_at 

209055_s_at 

228904_at 

224935_at 

236728_at 

238026_at 

200679_x_at 

205366_s_at 

225326_at 

205601_s_jat 

205600_x_at 

230743_at 

218645_at 

226236_at 

200023_s_at - HG-U133A 
222976 s at 



samples: 62/111 

0.895953757225434 

0.738387670770008 

5,17,21,25,32,39,45,50,51,52,54,58,64,70,76,77,86,113 

signal-to-noise P decision limit gene symbol 

0.848290349062723 0 EIF3S5 • 

0.786541449606513 0* 60.95 HOXB6 

0.7461 1 7686034684 0 YR-29 

0.744707047378434 0 HOXB6 

-0.714503324187594 0 CDC5L 

0.707817878234848 0* 407.75 

0.678011793604072 0 EIF2S3 

0.65220413902298 0 

0.649420462340497 0 RPL35A 

-0.648886126347934 0* 782.25 HMG1 

0.642743252402536 0 HOXB6 

0.638282205886151 0* 835.95 KIAA1311 

0.638114898511556 0 HOXB5 

0.637054089932379 0 HOXB5 

0.631769166997722 0 

0.629514230580722 0 ZNF277 

0.621355082335747 0 

0.620074058363248 0 EIF3S5 

-0.616559773343974 0 NTRK1 
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normal vs. t(8;21) 


samples: 62/13 








accuracy 


1 








confidence 










gene 


signal-to-noise 


P 


decision limit 


gene symbol 


228827_at 


-2.03460798747208 


0* 


162.15 




214651_s_at 


1.91662233809597 


0 




HOXA9 


205528_s_at 


-1.64467496425566 


0 




CBFA2T1 


235753_at 


1.61885010435257 


0 






213150_at 


1.50131650688172 


0 




it s~\ yy AAA 

HOXA10 


205529_s_at 


-1.4994140185132 


0 




CBFA2T1 


211341_at 


-1.4972813333234 


0 




POU4F1 


209905_at 


1.49669514190498 


0 




HOXA9 


217816_s_at 


1.49149650897312 


0 




pcnp 


206940_s_at 


-1.48516417693355 


0 




POU4F1 


219598_s_at 


1.43454923845259 


0 






205453_at 


1.38558116676561 


0 




HOXB2 


223498_at 


1.34625406703267 


0 






213844_at 


1.29651280723136 


0 




HOXA5 


213147_at 


1.27938232308768 


0 




HOXA10 


222448_s_at 


1.27922181680816 


0 




UMP-CMPK 


214000_s_at 


-1.2765364955951 


0 




RGS10 


235521_at 


1.27567444648757 


0 




HOXA3 


23581 8_at 


-1.27371437711255 


0 






217963_s_at 


1.27141827915888 


0 




HCS 
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normal vs. t(15;17) 
accuracy 
confidence 
gene 

212953_x_at 

203948_s_at 

214450_at 

38487_at 

224794_s_at 

233072_at 

236787_at 

221004_s_at 

209732_at 

214651_s_at 

216032_s_at 

200952_s_at 

208852_s_at 

200654_at 

204150_at 

235753_at 

64942_at 

203949_at 

20561 4_x_at 



samples: 62 / 20 

1 

1 

signal-to-noise 

-2.64495441949702 

-2.28625389247329 

-2.28053432941651 

-2.27507979049885 

-2.14479331910386 

-2.02319157581908 

-1.97410536661333 

-1.94014484972668 

1.93197336691263 

1.91723603396294 

-1.83899991398145 

-1.79422847402715 

-1.77756000185084 

-1.77272215049273 

-1.76741205974851 

1.73443643867114 

-1.71976456537364 

-1.70342423200997 

-1.65262473388504 





decision limit 


gene symbol 


0* 


4541.1 


CALR 


o 




MPO 


o 




CTSW 


o 




FLJ 12442 


0 




LOC51148 


0 




KIAA1857 


0 






o 




ITM3 


o 




CLECSF2 


0 




HOXA9 


0 




SDBCAG84 


0 




CCND2 


0 




CANX 


0 




P4HB 


0 




STAB1 


0 






0 






0 




MPO 


0 




MST1 



970 



WO 03/039443 
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normal vs. inv(16) 
accuracy 
confidence 
gene 

214651_s_at 

209365_s_at 

200951_s_at 

201497_x_at 

231310_at 

235753_at 

223385_at 

231259_s_at 

213353_at 

202370_s_at 

209905_at 

204661_at 

200675_at 

207194_s_at 

213150_at 

23581 8_at 

228834_at 

201324_at 

218942 at 



samples: 62/12 
1 

0.955620690178973 
signal-to-noise 

1.8118390923002 

-1.79092214915991 

-1.75939101209319 

-1.74852845185764 

-1.60730854736648 

1.60402701006007 

-1.57142154491015 

-1 .49428776750791 

1.47632425092475 

1.47587674217457 

1.44874209939309 

-1.44586168569344 

1.39363453223823 

-1.38591797740996 

1.36428212315189 

-1.35992709972839 

-1.35025928739893 

-1.34841230643998 

1.34162679981249 



P 

0 

0* 

0 

0 

0* 

0 

0 

0 

0 

0* 

0 

0 

0 

0 

0 

0 

0 

0 

0 



decision limit 



390.9 



167.8 



942.4 



gene symbol 

HOXA9 

ECM1 

CCND2 

MYH11 



CYP2S1 

CCND2 

ABCA5 

CBFB 

HOXA9 

CDW52 

CD81 

ICAM4 

HOXA10 

TOB1 
EMP1 
FLJ22055 
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BNSDOCID: <WO__ 



_03039443A2J_> 
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PCT/EP02/12303 



normal vs. MLL 

accuracy 

confidence 

failed: 

gene 

205453_at 

225406_at 

222465_at 

225326_at 

200829_x_at 

227786_at 

200056_s_at - HG-U133B 

238856_s_at 

200673_at 

226250_at 

233559_s_at 

227680_at 

228904_at 

225700_at 

202377_at 

229232_at 

231870_s_at 

222982_x_at 

209160_at 

223982 s at 



samples: 62/15 

0.948051948051948 

0.898189333713789 

50,54,68,69 

signal-to-noise 

1.38558116676561 

1.3008944013052 

1.26538127373241 

1.13719416264236 

1.1359357025281 

1.1032082540668 

1.0989163336081 

1 .093395457322 

1.0918171848605 

1.0869927098714 

-1.08564709559314 

1.0800659307262 

1.06689615547167 

1.0577778855845 

1.05091847162674 

1.04560760831211 

1.04167487270266 

1.04035495200911 

1.03517022790078 

1.03308565481985 



P 

0* 

0* 

0* 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 



decision limit 

203.25 

191.3 

1271.25 



gene symbol 
HOXB2 • 
TSG 

C15orf15 

KIAA1311 

ZNF207 

TRAP25 

C1D 

LAPTM4A 
FENS-1 



HSOBRGRP 

LOC51068 
SLC38A2 
AKR1C3 
1PLA2 
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t(8;21) vs. all other 


samples: 13/160 








accuracy 


0.994219653179191 








confidence 


1 








failed: 


30 








gene 


signal-to-noise 


P 


decision limit 


gene symbol 


228827_at 


1.83673626836317 


0 






211341_at 


1.52047414341304 


0 




POU4F1 


205528_s_at 


1.51763257492904 


0 




CBFA2T1 


206940_s_at 


1.51745205298829 


0 




POU4F1 


205529_s_at 


1.43922881664177 


0* 


157.65 


CBFA2T1 


206622_at 


1.22745062154907 


0 




TRH 


219598_s_at 


-1.21843492013845 


0 






214651_s_at 


-1.19784094554498 


0 




HOXA9 


221581_s_at 


-1.18072565374051 


0 




WBSCR5 


227279_at 


-1.10206247133481 


0 




MGC 15737 


20481 1_s_at 


1.07742972237323 


0 




CACNA2D2 


213150_at 


-1.06934246090682 


0 




HOXA10 


223498_at 


-1.06722279372155 


0 






215Q87_at 


-1.-06292558713408 


0 






209522_s_at 


-1.05686457779708 


0 




CRAT 


204495_s_at 


-1.05020858389696 


0 




DKFZP434H1 


34689_at 


-1.04236110585044 


0 




TREX1 


201425_at 


-1.04073518265119 


0 




ALDH2 


22501 0_at 


-1.0307918013289 


0 
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t(8;21) vs. t(15;17) 
accuracy 
confidence 
gene 

214450_at 

38487_at 

209732_at 

204150_at 

201596_x_at 

213944_x_at 

230526_at 

212509_s_at 

211990_at 

20431 9_s_at 

205614_x_at 

216320_x_at 

224794_s_at 

224839_s_at 

227326_at 

238365_s_at 

228827_at 

228570_at 

205349_at 

AFFX-HUMRGE/M1 0098_3_at 
U133B 



samples: 13/20 

1 

1 

signal-to-noise P 

-3.53862069365814 0* 

-3.30334484935728 0 

3.18277220746091 0 

-2.73329912927614 0 

-2.73162867034962 0 

-2.5862372690463 0 

2.42276930706474 0 

-2.33481477262277 0 

2.3270695509372 0 

2.25060861801642 0 

-2.24846130719191 0 

-2.16690689445934 0 

-2.14479331910386 0 

-2.07810412712239 0 

-2.04924504239987 0 

-2.03674279873081 0 

2.03460798747208 0 

-2.03169244854036 0 

-2.02142471684527 0 

HG- 0.424247547600803 0.01 



decision limit 

1809.95 



gene symbol 

CTSW 

FLJ12442. 

CLECSF2 

STAB1 

KRT18 

FLJ20015 

HLA-DPA1 

RGS10 

MST1 

LOC51148 
GPT2 



GNA15 



BNSDOCID: <WO 03039443A2_I_> 
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t(o,21) vs. inv(16) 


rnmnlnc- iO / HO 

samples, i o / iz. 








accuracy 


A 
1 








confidence 










gene 


signal-to-noise 


P 


apA a^. Af% « Z ISm^2£ 

aecision limit 


gene symuoi 


233138_at 


OOl aCOOZ/ 114 


U 


C/l OR 




202283_at 


-2.61 6070/ oSoloo 


U 




otKrlNr i 


201596_x_at 


O CCC-17'lQQr;QQCiC 

-^.oooi ( 1 ooooyoi O 


U 




l\K 1 To 


233555_s_at 


O A QO/IOCyl H oco7no 

-2.4oy4o04iyoo/ Ub 


U 






22681 8_at 


-^oo21o/b/D1 /^o 


U 






212828_at 


-2.331 30605042964 


0 




bYNJ2 


227276_at 


-2.31 1 71 71 8422321 


0 




"1 LKX7D 

TcM/R 


224724_at 


-2.30850265580909 


0 




Kl AA1 247 


224764_at 


-2.259344891 79779 


0 




a our* a oh r\ 

AKHGAP10 


224049_at 


-2. 25244855640038 


0 




KCNK17 


205453_at 


-2.23059959679219 


0 




HOXB2 


226841_at 


-2.21329419316194 


0 






209975_at 


-2.21237181448127 


0 


i 


CYP2E 


205076_s_at 


-2.21083117233863 


0 




CRA 
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Table 36: Analysis of 8 AML subgroups according to the method described by Westfall & young. The 226 significant 
gdnes are listed. The raw p value as well as the adjusted p value are shown. Furthermore it is indicated whether the genes 
were also identified as discriminative by the method described by Golub et al. (* most important gene in pairwise 
comparisons, + one of the top 20 significant genes) 
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Table 37a: in total 32 cases of CLL were analyzed. 31 of 32 cases (96.9%) were assigned to the correct CLL 
subtype in all pairwise comparisons. The sensitivity indicated for each subgroup indicates the percentage of 
cases of the specific subgroup identified correctly in all pairwise comparisons. The specificity indicates for 
each subgroup the percentage of correct assignments to this subgroup. 
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Table 37b: In total 128 individual assignments of CLL were analyzed. 127 of 128 assignments (99.2%) w 
correct. The sensitivity indicated for each subgroup indicates the percentage of correct assignments 
cases of the specific subgroup in pairwise comparisons. The specificity indicates for each subgroup 
percentage of correct assignments to this subgroup. 
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Table 38: Analysis of 5 CLL subtypes according to the method described by Golub etal. 
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231538_at 


-3.6635797888621 1 


0.03 




FLJ23499 


223981_at 


-3.20166631174034 


0.01 




NIN 


210095_s_at 


-3.18292733506553 


0.03 




IGFBP3 


44669_at 


1 .48597424301687 


0.03 
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11q- vs. normal 


samples: 4/9 








accuracy 


1 








confidence 


1 








gene 


signal-to-noise 


P 


aecision limit 


gene syrnuui 


227988_s_at 


*~» fT\ A OA tT A A A tZfX'T A T 

-3. 591 81 5441 5071 7 


0 


AO Q 




202883_s_at 


-3.42911376512134 


U 




PPP9R1R 


239824_s_at 


-2.86491 576775498 


n 
u 






208741 _at 


-2.57358498823744 


r\ 
u 




CADi Q 
Onr I O 


235446_at 


-2.44561 600023ODO 


U 






243024_at 


-2.32550716609078 


0 






214152_at 


-2.281 16624754023 


0 




nipD 


205945_at 


-2 . 250 1 933683546 


0 




II CD 


230405_at 


-2.21 54536025941 


0 




nAncrt 
KAUOU 


240449_at 


-2. 1 6736443161 623 


0 




"7MCTQ/1 -i 

Z.Nr 341 


226735_at 


2.13097198546745 


0 






23571 9_at 


-2.044861 390001 24 


0 






240269_at 


-2.004Q7877559944 


0.02 


1 




231837_at 


-1.99281259915495 


0.01 




USP28 


211584_s_at 


-1.92263813056307 


0 




NPAT 


212397_at 


-1.91363941690428 


0 




RDX 


228083_at 


-1.91262419018279 


0 






217185_s_at 


-1.90105615613598 


0 






201034_at 


-1.87885968131857 


0 




ADD3 


244636_at 


1.41086397558577 


0.01 
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t 



1 3q- vs. all other 

accuracy 

confidence 

failed: 

gene 

240239_at 

225803_at 

21403CLat 

20861 2_at 

205059_s_at 

224324_at 

220050_at 

212133_at 

209561_at 

212346_s_at 

210563_x_at 

219846_at 

20091 7_s_at 

214693_x_at 

224254_x_at 

216363_at 

206178_at 

201829_at 

242065 x at 



samples: 10/22 
0.90625 

0.758934573190992 
1,4,10 

signal-to-noise 

1.12265317688576 

-1.11955038091099 

-1.07932639824253 

-1.04692103821733 

-1.02298997137232 

-1.0017529706033 

-0.942587733303296 

-0.939561276009625 

-0.895424188378393 

-0.86343267965412 

-0.857353032891431 

0.851391806455573 

-0.847094054810162 

-0.842327739447741 

-0.840493461837671 

0.840371719572254 

-0.839970145359486 

-0.83771452268015 

0.752061453727533 



P 

0 
0 
0 
0 
0 

0.01 

0 

0 

0.01* 1 

0.01* 1 

0 

0 

0 

0* 51 3.4 
0* 1 
0 

0.01* 1 

0 

0 



decision limit 



gene symbol 

FLJ 14779 

FBX032 

FLJ14393 

GRP58 

IDUA 

B29 

C9orf9 

MGC5466 

THBS3 

CFLAR 

FU23040 

SRPR 

DJ328E19.C1.1 



PLA2G5 

NET1 

KIAA0982 
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13q- vs.17p- 
accuracy 
confidence 
gene 

205074_at 

205059_s_at 

206530_at 

239302_s_at 

211097_s_at 

213922_at 

206748_s_at 

230960_at 

219099_at 

209399_at 

234682_at 

201247_at 

227047_x_at 

227336_at 

44563_at 

202137_s_at 

204703_at 

232841 _at 



samples: 10/4 

1 

1 

signal-to-noise 

3.93082740953048 

-2.86213992210757 

-2.83791228863181 

2.16473291745382 

2.10829683608662 

2.01548216857792 

-1 .88775766295876 

1.78981943765595 

-1.78601869648563 

1.77172423634645 

-1.75764919544644 

-1.75383479254057 

1.70965672311403 

-1.67331959955983 

1 .6395923566301 

1 .63835350305949 

1.63405057461184 

1.61894883955233 



P 

0* 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0.02 

0 

0 

0 

0 

0 

0 

0 



decision limit 

47.2 



gene symbol 

SLC22A5 

IDUA 

RAB30 

PBX2 

KIAA0847 

SPAG9 

C12orf5 
HLCS 

SREBF2 

KIAA1538 

DTX1 

FLJ 10385 

BS69 

TG737 



1001 



BNSDOCID: <WO_ 



_03039443A2_I_> 
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13q- vs. normal 

accuracy 

confidence 

failed: 

gene 

212133_at 

231945_at 

236707_at 

240113_at 

233137_at 

239279_at 

217920_at 

220050_at 

214693_x_at 

240239_at 

206633_at 

233845_at 

214030_at 

214452_at 

244636_at 

22971 1_s_at 

20861 2_at 

201829_at 

225025_at 

5691 9_at 



samples: 10/9 
0.947368421052632 
0.767885192437171 
1 

signal-to-noise p 

-1.49179585241618 0 

1.34403975576724 0 

-1.3127085187848 0 

1 .2998845409231 4 0.01 

-1.24704962815733 0* 

1.17173879184892 0* 

-1.16948313904991 0* 

-1.1554548232117 0.01 

-1.12710287831488 0* 

1.12265317688576 0 

1.11411516845923 0 

1.10899780853203 0 

-1.10624113689033 0 

-1.10585601374159 0.01 
1.09426538931965 . 0 

-1.08284772025586 0* 

-1.04802029197291 0 

-1.04541705234559 0 

-1 . 04209808409802 0 

0.843685137333952 0 



decision limit gene symbol 
MGC5466 
KIAA1275 
DAPP1 

72.3 

10.95 

100.65 

C9orf9 

785.75 DJ328E19.C1.1 
FLJ 14779 
CHRNA1 

FLJ14393 
BCAT1 

175.2 MGC5370 
GRP58 
NET1 
IGSF8 
KIAA1449 
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i 

17p- vs. all other 
accuracy 
confidence 
gene 

213922_at 

239302_s_at 

210807_s_at 

227047_x_at 

243493_at 

212601_at 

231069_at 

228183_s_at 

218938__at 

209754_s_at 

229487_at 

202101_s_at 

204075_s_at 

225064_at 

211800_s_at 

44563_at 

209003_at 

233261 _at 



samples: 4 / 28 
1 

0.964568811002079 

signal-to-noise p 

-2.38687598396346 0 

-1.90616910234578 0 

-1.80157318540302 0* 

-1.74312208164522 0* 

-1 .5834922051 21 96 0.01 * 

-1.53446329571692 0 

-1.52786491236881 0 

-1.52482033281554 0 

-1.52269705250236 0 

-1.51357080613702 0 

1.48496465895187 0.01 

-1.48060713997599 0 

-1.44015391162326 0 

-1.43255380391079 0 

-1.42133619880381 0 

-1.41946867497272 0 

-1 .41 936503927243 0.01 

1.41906569641121 0 



decision limit gene symbol 
KIAA0847 

1 SLC16A7 
1 KIAA1538 
1 

KIAA0399 

MGC4189 
MGC 11279 



RALB 
KIAA0562 

USP4 

FLJ10385 

SLC25A11 
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17p- vs. normal 
accuracy 
confidence 
gene 

239263_at 

224389_s_at 

218938_at 

209558_s_at 

209003_jat 

213922_at 

227277_at 

201247_at 

239824_s_at 

243493_at 

202101_s_at 

238208_at 

232641_at 

38340_at 

225565_at 

235692_at 

212587_s_at 

209178_at 

226101 at 



samples: 4/9 

1 

1 

signal-to-noise 

-5.06986350689012 

4.85220428207756 

-4.37035592768397 

3.96926728013304 

-3.83037277930439 

-3.70320846722859 

-3.14601684537179 

3.13384207920433 

-2.86491576775498 

-2.55548617887833 

-2.51633093119791 

2.48580500500254 

-2.43239639363709 

2.39155889740356 

-2.36688673011482 

-2.35357613574803 

-2.31297420341289 

2.30977713582653 

-2.30435376005559 



P 

0* 

0.01 

0 

0.01 
0 
0 
0 

0.01 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 



decision limit 

28.95 



gene symbol 

LOC84570 • 

MGC11279 

HIP12 

SLC25A11 

KIAA0847 

SREBF2 
MGC10744 

RALB 



HIP12 



PTPRC 
DDX38 



1004 
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normal vs. all other 
accuracy 
confidence 
gene 

217920_at 

233845_at 

239279_at 

231945_at 

240113_at 

219641_at 

219457_s_at 

211349_at 

228408_s_at 

214949_at 

235052_at 

212417_at 

244636_at 

226060_at 

239263_at 

200890_s_at 

208042_at 

206550_s_at 

201813_s_at 

221543_s_at 



samples: 9/23 
1 

0.752908435315332 

signal-to-noise p 

1.12626688471637 0* 

-1.11422356526216 0 

-1.07195619687299 0* 

-1.04730583025734 0* 

-1.03819026438649 0* 

1.01342862157126 0* 

1.01203293419066 0* 

-0.983993758419454 0 

0.980977651352115 0* 

0.9706833699361 25 0 

0.970295482117812 0* 

0. 947300555400096 0 

-0.940508403486097 0.01 

0.905224209711651 0 

0.904436494271519 0.01 

0.896568314554417 0 

0.88315849081967 0 

-0.869633758548337 0 

0.854527572567661 0 

0.842343687568175 0 



decision limit gene symbol 
120,8 

1 

47.05 KIAA1275 
1 

68.05 FLJ10103 

653.6 RIN3 
SLC15A1 

301.7 FLJ10498 
18.2 

SCAMP 1 

RFT1 

SSR1 

HSU84971 

NUP155 

KIAA0210 

C8orf2 
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Table 39: Analysis of 5 CLL subgroups according to the method described by Westfall & Young. The raw p 
value as well as the adjusted p value are shown. Furthermore it is indicated whether the genes were also 
identified as discriminative by the method described by Golub et al. (* most important gene in pairwise 
comparisons, + one of the top 20 significant genes) 





Golub 


rawo adip 




99Q487 at 




0.0016 


0.4057 


^jO^u I at 


+ 


0.0016 


0.6404 






0.0003 


0.7273 


94487ft at 




0.0016 


0.831 


OH 01 00 a* 


+ 


0.0001 


0.8508 MGC5466 


009904. flt 




0.0008 


0.889 EBF 


9.4/1 i 47 at 




0.0014 


0.9084 


940740 flt 




0.0016 


0.926 


99TC97 at 




0.0001 


0.943 


908^1 Q v at 




0.0007 


0.9529 SIGLEC6 


9«ioq99 of 
Z i 05?ZZ — _dl 


+ 


0.0001 


0.9568 KIAA0847 


907801 q at 




0.0011 


0.9745 RNF10 


919819 at 




0.0003 


0.9763 


9/I909Q 




0.0016 


0.9777 


991004 s at 




0.0012 


0.9807 ITM3 


90Q^91 at 




0.0005 


0.9837 ADCY3 


44^8°, flt 




0.0003 


0.9839 FLJ10385 


91981°, at 




0.0013 


0.984 FLJ14529 


217156 at 




0.0012 


0.9862 


919099 at 


+ 


0.0004 


0;9864 C12orf5 


9°,R°.85 s at 




0.0004 


0.988 


919846 at 


* 


0.0003 


0.9957 FLJ23040 


227936_at 




0.0001 


0.9975 


228497_at 




0.0002 


0.9992 DKFZp761G0313 


244523_at 




0.0007 


0.9992 MMD 


222108_at 




0.0008 


0.9994 


212601_at 


+ 


0.0003 


0.9996 KIAA0399 


22231 3_at 




0.0015 


0.9996 


204208_at 




0.0016 


0.9997 RNGTT 


210916_s_at 




0.0009 


0.9997 


224872_at 




0:0006 


0.9997 KIAA1463 


241930_x_at 


* 


0.0006 


0.9997 


202530_at 




0.0007 


0.9998 MAPK14 


203026_at 




0.0004 


0.9998 KIAA0354 


203394_s_at 




0.001 


0.9998 HRY 
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21 5227_x_at 0.0002 0.9998 ACP1 

229138 at 0.0012 0.9998 



1007 
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Table 40a: In total 280 cases of leukemia and normal bone marrow (BM) were analyzed. 263 of 280 cases 
(93.9%) were assigned to the correct leukemia subtype in all pairwise comparisons. The sensitivity indicated 
for each subgroup indicates the percentage of cases of the specific subgroup identified correctly in all 
pairwise comparisons. The specificity indicates for each subgroup the percentage of correct assignments to 
5 this subgroup. 





ALL 


AML 


CLL 


CML 


normal 
BM 


total 


sensitivity 

% 


specificity 

% 


ALL 


36 

' i 


9 


1 


1 




47 


76.60 


94.74 


AML 


2 


■ 

. ' 1?2 

\ % ' % 






1 


175 


98.29 


93.48 


CLL 




1 


Vi/V' 34 
i 






35 


97.14 


97.14 


CML 




1 




13 




14 


92.86 


92.86 


normal 
BM 




1 






< 

8 


9 


88.89 


88.89 


total 


38 


184 


35 


14 


9 


280 
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Table 40b: In total 1120 individual assignments of leukemia were analyzed. 1103 of 1120 assignments (98.5%) 
were correct. The sensitivity indicated for each subgroup indicates the percentage of correct assignments 
for cases of the specific subgroup in pairwise comparisons. The specificity indicates for each subgroup the 
percentage of correct assignments to this subgroup. 





ALL 


AML 


CLL 


CML 


normal 
BM 


total 


sensitivity 

% 


specificity 

% 


ALL 




9 


1 


1 




188 


94.15 


98.88 


AML 


2 


v ; 6 98 






1 


700 


99.71 


98.31 


CLL 




1 


<P 1 39 

fi« mi 






140 


99.29 


99.29 


CML 




1 








56 


98.21 


98.21 


normal 
BM 




1 








36 


97.22 


97.22 


total 


179 


710 


140 


56 


36 


1120 
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Table 41: Analysis of 4 leukemia subtypes and normal bone marrow according to the method described by 
Golub et al. 



ALL ( 47 

AML 175 

CLL 35 

CML 14 

normal BM 9 



ALL vs. all other 
accuracy 
confidence 
failed: 

gene 

214761_at 

219013_at 

243362_s_at 

229487_at 

22491 8_x_at 

231736_x_at 

203373_at 

201540_at 

218589_at 

210487_at 

219753_at 

223703_at 

203372_s_at 

243363_at 

208248_x_at 

208702_x_at 

200661_at 

34726_at 

203795_s_at 

230292 at 



samples: 47 / 233 

0.960714285714286 

0.912089069819735 

11,12,13,14,16,20,24,41,42,45 

,46 

signal-to-noise p 

0.970557084257439 0 

-0.89849634331 4981 0* 

0.885999523838621 0* 

0.879786130972461 0 

-0.87724222950274 0 

-0.869791217148163 0 

0.858710621146821 0 

0.85206943650763 0 
0.849690653983403 * 0 

0.84637653892686 0 

0.825247459991453 0 

-0.822007469078868 0 

0.821441808884396 0 

0.820493794299965 0 

-0.81 29401 51 824066 0 

-0.8015522739222 0 

-0.791253715797081 0 

0.790007042454156 0 

0.78850089331931 0 

0.788256600173715 0 



decision limit 

66.1 
72.85 



gene symbol 
OAZ 

FLJ21634 
LEF1 

MGST1 

MGST1 

STATI2 

FHL1 

P2Y5 

DNTT 

STAG3 

CDA017 

STATI2 

LEF1 

APLP2 

APLP2 

PPGB 

CACNB3 

BCL7A 
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ALL vs. AML 




samples: 47/175 








accuracy 




0.95045045045045 








confidence 




0.894203385037297 








failed: 




1 1 , 1 2, 1 4, 1 5, 1 6,24,27,36,42,67 
' ,115 








gene 




signal-to-noise 


P 


decision limit 


gene symbol 


221558_s_at 




1.12614883575596 


0 




LEF1 


22491 8_x_at 




-1.11745973177444 


0 




MGST1 


41220_at 




1.1150066427645 


0 




MSF 


231736_x_at 




-1.10060017375911 


0 




MGST1 


205382_s_at 




-1.01589067961124 


0 




DF 


200661 _at 




-1.01253920985203 


0 




PPGB 


208702_x_at 




-1.00158300302362 


0 




APLP2 


223703_at 




-0.995387894243593 


0 




CDA017 


211404_s_at 




-0.985001465936684 


0 




APLP2 


208248_x_at 




-0.981211344877388 


0 




APLP2 


210948_s_at 




0.973218284228957 


0 




LEF1 


243363_at 




0.96583392519617 


0 




LEF1 


214875_x_at 




-0.960268914986593 


0 




APLP2 


214761_at 




0.959842337636482 


0 




OAZ 


219013_at 




-0.956595406819358 


0 




FLJ21634 


243362_s_at 




0.953556737247221 


0 




LEF1 


204215_at 




0.94873841769727 


0* 


886.4 


MGC4175 


217989_at 




-0.944775256168326 


0 




LOC51170 


203041 _s_at 




-0.944024039515523 


0 




LAMP2 


AFFX-r2-Bs-dap-3_at 


- HG- 0.252552477574741 


0 







U133A 
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ALL vs. CLL 

accuracy 

confidence 

failed: 

gene 

225927_at 

224838_at 

239287_at 

202625_at 

208091_s_at 

20451 1_at 

AFFX- 

HUMGAPDH/M331 97__3_at 

-HG-U133B 

201417_at 

212914_at 

20761 6_s_at 

223391_at 

201416_at 

44790_s_at 

201462_at 

205997_at 

219471_at 

228390_at 

206337_at 

218191_s_at 

AFFX- 

HUMGAPDH/M33197_M_at 
-HG-U133A 



samples: 47 / 35 

0.98780487804878 

1 

42 

signal-to-hoise 

-2.54386605255744 
-2.29762422456477 
-2.24993335468714 
-2.06005808097774 
-2.03821117161393 
-2.00374139556454 
1.9580832868002 



1.9394827392449 

-1.89064460393095 

-1.88874561163207 

-1.87749885738082 

1.87582200205394 

-1.85742303821942 

-1.84559373056342 

-1 .82244878004852 

-1.82048094941098 

-1.81369080029951 

-1.81234629147377 

-1.80317310126938 

1.47237903994758 



P 

0* 

0 

0 

0 

0 

0 

0 



0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 



decision limit 

1826.45 



gene symbol 
MAP3K1 



LYN 

DKFZP564K0822 

KIAA0793 

GAPD 



PKP4 

TANK 

LOC81537 

SOX4 

FLJ21562 

KIAA0193 

ADAM28 

FLJ21562 

CCR7 
FLJ 11240 
GAPD 



1012 
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_03039443A2J_> 



WO 03/039443 



PCT/EP02/12303 



ALL vs. CML 

accuracy 

confidence 

failed: 

gene 

206440_at 

210254_at 

205557_jat 

200654_at 

206676_at 

206111_at 

212268_at 

214317_x_at 

211275_s_at 

212531_at 

206207_at 

203757_s_at 

204174_at 

207802_at 

211657_at 

213572_s_at 

203949_at 

204351_at 

229790_at 

231854 at 



samples: 47/14 
0.983606557377049 



1 

14 

' signal-to-noise p 

-2.27699665329749 0 

-2.24512948903079 0 

-2.24379338781824 0* 

-2.2057677822486 0 

-2.19594220925272 0 

-2.12209587819278 0 

-2.06787563904532 0 

2.05505121682362 0 

-2.0481841650404 0 

-1.99958187859813 0 

-1.98193517857499 0 

-1.92761412477345 0 

-1.91747003112972 0 

-1.90928433527318 0 

-1.8921241480708 0 

-1.87212093424446 0 

-1.85990729100683 0 

-1.8590267215838 0 

1.85536512952314 0 

1.67066719911877 0 



decision limit gene symbol 
VELI1 

8347.8 BPI 
P4HB 
CEACAM8 
RNASE2 
SERPINB1 
RPS9 
GYG 
LCN2 
CLC 

CEACAM6 
ALOX5AP 
SGP28 

SERPINB1 
MPO 
S100P 
TERF2 



BNSDOCID: <WO 03039443A2_I_> 



1013 



WO 03/039443 



PCT/EP02/12303 



ALL vs. normal BM 
accuracy 
confidence 
gene 

203645_s_at 

224976_at 

224975_at 

218424_s_at 

223044_at 

230988_at 

225792_at 

223280_x_at 

224970_at 

215049_x_at 

224356_x_at 

218916_at 

201506_at 

202443_x_at 

226751_at 

218516_s_at 

201416_at 

226448_at 

221731_x_at 

40189 at 



samples: 47/9 
1 

0.957516862739848 



signal-to-noise p 

-2.39693596113584 0 

-2.31479350464798 0 

-2.18458278616315 0 

-2.15949227895441 0 

-2.14051279056572 0 

-2.14049582400222 0 

-2.11616520995308 0 

-2.05336993376349 0* 

-2.02013541876615 0 

-1.97618634292152 0 

-1.93812195252731 0* 

1 .86645836675574 0* 

-1.80048485113822 0 

-1.76350311549162 0 

-1.75212231661127 0 

1.74956324093924 0 

1 .74432934603701 0 

-1.73849176051715 0 

-1.73207109032378 0 

1.37012925113272 0 



decision limit gene symbol 
CD163 
NFIA 
NFIA 
FLJ10829 
SLC11A3 



2365.65 MS4A6A 
NFIA 
CD163 

2203.55 MS4A6A 
26.2 FLJ23436 
TGFBI 
NOTCH2 
DKFZP566K1924 
FLJ20421 
SOX4 

CSPG2 
SET 



BNSDOCID: <WO 03039443A2J_> 



1014 



WO 03/039443 



PCT/EP02/ 12303 



AML vs. all other 
accuracy 
confidence 
failed: 



gene 

205382_s_at 

221558_s_at 

233177_s__at 

221969_at 

206398_s_at 

41220_at 

200661_at 

203005_at 

211404_s_at 

208702_x_at 

210948_s_at 

221739_at 

206255_at 

205049_s_at 

231736_x_at 

214875_x_at 

241353_s__at 

215785_s_jat 

242774_at 

201200_at 



samples: 175/105 

0.882142857142857 

0.867062232379983 

1 7,20, 35,51 ,64,68,86, 1 43, 1 45, 1 89, 1 91 , 1 99,21 7,253,258,259,260, 
261 ,262,263,264,265,266,267,268,269,270,271 ,272,273,274,275, 
280 

p decision limit 

0 
0 
0 

0* 607.75 
0* 365.2 
0 



signal-to-noise 

0.993761979191943 
-0.953118583183064 
0.898951559660059 
-0.897763040671617 
-0.887949095602823 
-0.874923766956177 
0.864835685361425 
0.859889652633032 
0.848185516601275 
0.844435105361934 
-0.841151060868526 
0.821598909550794 
-0.821040810957811 
-0.820101703791881 
0.818968232559464 
0.818667567167621 
0.816044719173918 
-0.815070063021364 
-0.813216814821168 
0.812769337801728 



0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 



gene symbol 
DF 
LEF1 

KIAA1184 
PAX5 
CD19 
MSF 
PPGB 
LTBR 
APLP2 
APLP2 
LEF1 
IL27 
BLK 
CD79A 
MGST1 
APLP2 

CYFIP2 
SYNE-2 
CREG 



BNSDOCID: <WO 03039443A2_I_> 



1015 



WO 03/039443 — PCT/EP02/12303 



AML vs. CLL 

accuracy 

confidence 

failed: 

gene 

224838_at 

243780_at 

22351 4_at 

212827_at 

239287_at 

44790_s_at 

228390_at 

229072_at 

224837_at 

223391_at 

219471_at 

208456_s_at 

201998_at 

223287_s_at 

212590__at 

229844_at 

236280_at 

230768_at 

208091 _s_at 

208864_s_at 



samples: 175/35 
0.995238095238095 

1 

206 

signal-to-rioise 

-3.02914553670118 

-2.47595031535238 

-2.32616269321224 

-2.28998110195367 

-2.23009354706297 

-2.21489306728323 

-2.19221126790713 

-2.17738285030653 

-2.17198876436932 

-2.16212129696244 

-2.15559694298803 

-2.12547066951316 

-2.10911401275474 

-2.10564435434002 

-2.10270770151805 

-2.06808415651229 

-2.03990689910927 

-2.02994432200127 

-2.02423665232113 

2.01877334646697 



P 

0 

0 

0* 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 



decision limit gene symbol 



401.75 



CARD1 1 
IGHM 

FLJ21562 



FOXP1 

LOC81537 

FLJ21562 

RRAS2 

SIAT1 

FOXP1 



DKF2P564K0822 
TXN 



BNSDOCID: <WO 03039443A2_I_> 



1016 



WO 03/039443 



PCT/EP02/12303 



AML vs. CML 

accuracy 

confidence 

failed: 

gene 

212531_at 

205557_at 

207802_at 

209772_s_at 

206676_at 

210244_at 

216379_x_at 

209771 _x_at 

203936_s_at 

20201 8_s_at 

201029_s_at 

223894_s_at 

207269_at 

203757_s_at 

205863_at 

211657_at 

20551 3_at 

223839_s_at 

210254_at 

39835 at 



samples: 175/14 
0.994708994708995 
0.981183073260496 
180 

' signal-to-noise p 

-2.81906069321688 0 

-2.2207882876404 0 

-2.21 833080555492 0* 

-2.16935748435446 0* 

-2.13938671438891 0* 

-2.09652835954526 0 

-2.08891207162947 0 

-2.07371303677609 0 

-2.04218257187719 0* 

-1.88089473759452 0 

1.75719687256454 0* 

-1 .74095406790845 0* 

-1.71722340830381 0* 

-1.70294141320522 0 

-1.70288697664328 0 

-1.69693858255424 0 

-1.64900632810016 0 

-1.60135318115808 0 

-1.56841234039023 0 

1.23907392901905 0 



decision limit 



3039.3 
3429.4 
7865.1 



7297.15 

1592.95 

455.2 

15756.25 



gene symbol 

LCN2 
BPI 

SGP28 
CD24 
CEACAM8 
CAMP 

CD24 

MMP9 

LTF 

MIC2 

FTS 

DEFA4 

CEACAM6 

S100A12 

TCN1 



SBF1 



BNSDOCJD: <WO 03039443A2_I_> 



1017 



WO 03/039443 



PCT/EP02/12303 



AML vs. normal BM 

accuracy 

confidence 

failed: 

gene 

212531_at 

210244_at 

20201 8_s_at 

231241_at 

218516_s_at 

225792_at 

228377_at 

244652_at 

225700_at 

201396_s_at 

207802_at 

208141_s_at 

207269_at 

200631_s_at 

208651_x_at 

227151_at 

210004_at 

203367_at 

218916_jat 

40189 at 



samples: 175/9 

0.989130434782609 

1 

48,179 

signal-to-noise 

-2.40063827757397 

-2.19732582937896 

-2.14183206922033 

-2.03350599733901 

1.74555082511394 

-1.63422305998419 

-1.59923589910665 

-1.53426287521002 

-1.53281555961668 

1 .46522297092893 

-1.44831955629631 

1.43174521272245 

-1.42268720126305 

1.40042176845386 

-1.39706589332311 

1.39359540599557 

-1.38874742675296 

1.38466765321816 

1.37378311700267 

1.35679082157773 



P 

0 

0 

0* 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 



decision limit 



12387 



gene symbol 
LCN2 • 
CAMP 
LTF 

FLJ20421 
KIAA1384 



SGT 

SGP28 

MGC4293 

DEFA4 

SET 

CD24 

OLR1 
DUSP14 
FLJ23436 
SET 



BNSDOCID: <WO 03039443A2J_> 



1018 



WO 03/039443 



PCT/EP02/12303 



OLL vs. all other 

accuracy 

confidence 

failed: 

gene 

224838_jat 

239287_at 

22351 4_at 

44790_s_at 

228390_at 

223391_at 

20809 1_s_at 

219471_at 

229072_at 

243780_at 

223287_s_at 

212590_at 

229844_at 

208456_s_at 

201998_at 

225927_at 

202589_at 

230768_at 

224837_at 

214615 at 



samples: 35 / 245 

0.996428571428571 

1 

31 

signal-to-noise 

2.79416338027605 

2.23575554655226 

2.14355675231863 

2.09543551175381 

2.08427210739342 

2.06238950802763 

2.03394814388952 

2.03307120566838 

2.02056757302765 

2.01909938888062 

2.00808281461583 

1.988255009037 

1.97715993452055 

1.97615733325 

1.97574573382318 

1.95021458529306 

-1.92898152025988 

1.92227084324811 

1.91127793077848 

1.91102072539205 



P 

0 

0* 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 



decision limit 

630.55 



gene symbol 



CARD 11 
FU21562 

LOC81537 

DKFZP564K0822 

FU21562 



FOXP1 



RRAS2 
SIAT1 
MAP3K1 
TYMS 

FOXP1 
P2Y10 



1019 



BNSDOCID: <WO_ 



_03039443A2_I_> 



WO 03/039443 



PCT/EP02/12303 



CLL vs. CML 
accuracy 
confidence 
gene 

210254_at 

205557_at 

206111_at 

206871_at 

202503_s_at 

20961 9_at 

203949_at 

202589_at 

212268_at 

212531_at 

206676_at 

210334_x_at 

204670_x_at 

224838_at 

208306_x_at 

212750_at 

200654_at 

213572_s_at 

207269_at 

41577 at 



samples: 35/14 

1 

1 

signal-to-noise 

-4.916136683483 

-4.69748391957959 

-4.58827527010058 

-4.44599783341785 

-4.43537957592345 

4.43385248113705 

-4.37548189755698 

-4.23719643920935 

-4.17631489395545 

-4.14992393038831 

-4.00803780946535 

-3.95114669210453 

3.87175276162045 

3.70493413772712 

3.70225355884714 

3.65712314278563 

-3.65009107919176 

-3.6081864481715 

-3.56289987570257 

3.23509608373299 



P 

0* 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 



decision limit 

3718.45 



gene symbol 
BPI 

RNASE2 

ELA2 

KIAA0101 

CD74 

MPO 

TYMS 

SERPINB1 

LCN2 

CEACAM8 

BIRC5 

HLA-DRB5 

HLA-DRB4 

PPP1R16B 

P4HB 

SERPINB1 

DEFA4 

PPP1R16B 



1020 



RNSDOCID: <WO_ 



_03039443A2_I_> 



WO 03/039443 



PCT/EP02/12303 



CLL vs. normal BM 
accuracy 
confidence 
gene 

210613_s_at 

204776_at 

218662_s_at 

205051_s_at 

202503_s_at 

202580_x_at 

20201 8_s_at 

206871 _at 

230988_at 

224975_at 

227230_s_at 

20971 4_s_at 

212531_at 

214575_s_at 

223785_at 

224976_at 

218424_s_at 

202705_at 

202589 at 



samples: 35/9 
1 

t 1 

signal-to-noise 

-4.82300971873091 
-4.80025506569938 
-4.51907004754891 
-4.20900192886026 
-4.20172034150208 
-4.18939678293477 
-4.17647701264208 
-4.05443661323157 
-4.04866661866689 
-3.82592750915745 
-3.69195069014623 
-3.58566958034221 
-3.57925128815766 
-3.54194227378832 
-3.42904648124574 
-3.40063969780409 
-3.38874340856322 
-3.38096906634746 
-3.36383889155021 



P 

0* 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 



decision limit 

459.4 



gene symbol 

SYNGR1 

THBS4 

HCAP-G 

KIT 

KIAA0101 
FOXM1 
LTF 
ELA2 

NFIA 

KIAA1211 

CDKN3 

LCN2 

AZU1 

FU10719 

NFIA 

FLJ10829 

CCNB2 

TYMS 



1021 



BNSDOCID: <WO_ 



_03039443A2_L> 



WO 03/039443 



PCT/EP02/12303 



CML vs. al! other 

accuracy 

confidence 

failed: 

gene 

212531_at 

205557_at 

206676_at 

209772_s_at 

207802_at 

216379_x_at 

209771 _x_at 

203936_s_at 

211657_at 

203757_s_at 

210254_at 

20201 8_s_at 

211275_s_at 

20551 3_at 

205863_at 

223839_s_at 

206440_at 

210244_at 

207269_at 

204174_at 



samples: 14/266 
0.982142857142857 
0.984872053615011 
1,5,7,11,28 

signal-to-noise p 

2.22285326348413 0 

2.21595356438375 0* 

2.03792028235761 0 

1.92045186041392 0* 

1.85737448791579 0 

1.78690596201348 0 

1 .78099473628645 0 

1.76415242821485 0 

1.70950599523981 0 

1.69567517095366 0 

1.67886160490067 0 

1.66318733776484 0 

1.64178539863162 0 

1.62356154923572 0 

1.61881375489405 0 

1.58264885795695 0 

1.58048354996482 0 

1.57522288410986 0 

1.5628690030838 0 
1.55441937407586 • 0 



decision limit gene symbol * 
LCN2 

11581.65 BPI 

CEACAM8 

3429.4 CD24 
SGP28 

CD24 
MMP9 

CEACAM6 

LTF 
GYG 
TCN1 
S100A12 

VELI1 
CAMP 
DEFA4 
ALOX5AP 



BNSDOCID: <WO 03039443A2J_> 



1022 



WO 03/039443 



PCT/EP02/12303 



OML vs. normal BM 
accuracy 
confidence 
gene 

227198_at 

204562_at 

227749_at 

228377_at 

22671 3_at 

20961 9_at 

227375_at 

204057_at 

201506_at 

221558_s_at 

227867_at 

225331 _at 

223280_x_at 

243780_at 

228055_at 

204670_x_at 

208683_at 

210356_x_at 

209670_at 

AFFX- 

HUMGAPDH/M33197_3_ 
- HG-U133B 



samples: 14/9 

1 

1 

signal-to-noise 

-3.96995770729437 

-3.83910818190111 

-3.11980636745679 

-2.65145492785893 

-2.28229285614419 

-2.27384420268273 

-2.221 84735546056 

-2.13119125485665 

-2.05058862618543 

-2.0055054625083 

-2.00414646646454 

-1.95134035825926 

-1.95125537387335 

-1.94601159348593 

-1.94418493221925 

-1.94197079510435 

-1.93662356244796 

-1.92014292171279 

-1.89146017871026 

1.0887744230798 



P 

0* 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 



decision limit 

39.95 



gene symbol 
IRF4 

KIAA1384 
CD74 

DKFZP566D1346 

ICSBP1 

TGFBI 

LEF1 

MS4A6A 



HLA-DRB5 

CAPN2 

MS4A2 

TRA 

GAPD 



at 



1023 



BNSDOCID: <WO_ 



_03039443A2_L> 



WO 03/039443 



PCT/EP02/12303 



normal BM vs. all other 


samples: 9/271 






accuracy 


0.975 






confidence 


0.993135080527801 






failed: 


1,3,4,5,6,8,9 






gene 


signal-to-noise 


P 


decision limit gene symbol 


231241_at 


1.74993445464942 


0* 


145.7 


20201 8_s_at 


1.72201479004034 


0 


LTF 


2l2531_at 


1.63441866839502 


0 


LCN2 


210244_at 


1.48912390888008 


0 


CAMP 


244652_at 


1.41243941049484 


0 




224976_at 


1.41100451645958 


0 


NFIA 


218916_at 


-1.39948677493198 


0 


FLJ23436 


218516_s_at 


-1.39516117721544 


0 


FLJ20421 


208141_s_at 


-1.38238694283986 


0 


MGC4293 


201396_s_at 


-1.3585420884411 


0 


SGT 


230988_at 


1.30213323583922 


0 




20063 1_s_at 


-1 .2906273376207 


0 


SET 


224975_at 


1.28440445542309 


0 


NFIA 


227497_at 


1.2690169947779 


0* 


317.95 


203535_at 


1.26454687252909 


0 


S100A9 


203367_at 


. -1.25252661696593 


0 


DUSP14 


40189_at 


-1.24465725545763 


0 


SET 


223785_at 


1.23866083411764 


0 


FLJ10719 


225700_at 


1.22842543098158 


0 




207269 at 


1.22161055733689 


0 


DEFA4 



BNSDOCID: <WO 03039443A2_I_> 



1024 



WO 03/039443 



PCT/EP02/12303 



Table 42: Analysis of 4 leukemia subgroups and normal bone marrow (BM) according to the method 
described by WestfalK & Young. The 499 significant genes are listed. The raw p value as well as the adjusted 
p value are shown. Furthermore it is indicated whether the genes were also identified as discriminative by 
the method described by Golub et al. (* most important gene in pairwise comparisons, + one of the top 20 
5 significant genes) 
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0.0001 


0.0001 




229934_at 


0.0001 


0.0001 




230110_at 


0.0001 


0.0001 




230245_s_at 


0.0001 


0.0001 




230363_s_at 


0.0001 


0.0001 


SAC2 


230489_at 


0.0001 


0.0001 


CDS 


230499_at 


0.0001 


0.0001 




230551_at 


0.0001 


0.0001 




230648_at 


0.0001 


0.0001 




230689_at 


0.0001 


0.0001 




230753_at 


0.0001 


0.0001 




230768_at 


+ 0.0001 


0.0001 




230793_at 


0.0001 


0.0001 


FLJ20048 


230802_at 


0.0001 


0.0001 


DKF2P564B1 


230803_s_at 


0.0001 


0.0001 


DKFZP564B1 


230834_at 


0.0001 


0.0001 




230877_at 


0.0001 


0.0001 


IGHG3 


230917_at 


0.0001 


0.0001 




230939_at 


0.0001 


0.0001 




230983_at 


0.0001 


0.0001 




230986_at 


0.0001 


0.0001 




230997_at 


0.0001 


0.0001 




231093_at 


0.0001 


0.0001 


FCRH3 


231332_at 


0.0001 


0.0001 




231406_at 


0.0001 


0.0001 




231418_at 


0.0001 


0.0001 


MS4A2 


231647_s_at 


0.0001 


0.0001 


1RTA2 


231656_x_at 


0.0001 


0.0001 


OSBPL10 


231775_at 


0.0001 


0.0001 


TNFRSF10A 


231793_s_at 


0.0001 


0.0001 


CAMK2D 


231794_at 


0.0001 


0.0001 


CTLA4 


231873_at 


0.0001 


0.0001 




232027_at 


0.0001 


0.0001 




232060_at 


0.0001 


0.0001 




232112_at 


0.0001 


0.0001 




232125_at 


0.0001 


0.0001 
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23221 0_at 

2^261 4_at 

232739_at 

232950_s_at 

233483_at 

233955_x_at 

234140_s_at 

234362_s_at 

234725_s_at 

235023_at 

235051_at 

235061_at 

235331_x_at 

235353_at 

235372_at 

235385_at 

235400_at 

235401_s_at 

235422_at 

235444_at 

235459_at 

235674_at 

235982__at 

236190_at 

236226_at 

236248_x_at 

236265_at 

236280_at 

236293_at 

236301_at 

236341_at 

236458_at 

236796_at 

236854_at 

236979_at 

23741 1_at 

238055_at 

238057_at 

238376_at 

23851 6_at 

238593_at 

238604 at 



0.0001 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 NIR3 


0.0001 


0.0001 


0.0001 


0.0001 HSPC195 


0.0001 


0.0001 STIM2 


0.0001 


0.0001 CTLA4 


0.0001 


0.0001 SEMA4B 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 K1AA0746 


0.0001 


0.0001 FREB 


0.0001 


0.0001 FLJ20668 


0.0001 


0.0001 FREB 


0.0001 


0.0001 FREB 


0.0001 


0.0001 FALZ 


0.0001 


0.0001 1 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 FCRH1 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 CTLA4 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 LOC15351 


U.UUU I 


n nnm 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 


0.0001 BMPR2 


0.0001 


0.0001 


0.0001 


0.0001 
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238790_at 


0.0001 


0.0001 


1 


2i9054_at 


0.0001 


0.0001 




239122_at 


0.0001 


0.0001 


IL24 


239152_at 


0.0001 


0.0001 




239229_at 


0.0001 


0.0001 




239231_at 


0.0001 


0.0001 




239278_at 


0.0001 


0.0001 




239287_at 


0.0001 


0.0001 




239292_at 


0.0001 


0.0001 




239442_at 


0.0001 


0.0001 




239679_at 


0.0001 


0.0001 




240061_at 


0.0001 


0.0001 




240572_s_at 


0.0001 


0.0001 




241483_at 


0.0001 


0.0001 




242104_at 


0.0001 


0.0001 




242388_x_at 


0.0001 


0.0001 




242404_at 


0.0001 


0.0001 




242458_at 


0.0001 


0.0001 




242463_x_at 


0.0001 


0.0001 




242541_at 


0.0001 


0.0001 


ABCA9 


242633_x_at 


0.0001 


0.0001 




242774_at 


+ 0.0001 


0.0001 


SYNE-2 


242866_x_at 


0.0001 


0.0001 




242932_at 


0.0001 


0.0001 




243030_at 


0.0001 


0.0001 




243154_at 


0.0001 


0.0001 




243780_at 


+ 0.0001 


0.0001 




243798_at 


0.0001 


0.0001 




243932_at 


0.0001 


0.0001 




243968_x_at 


0.0001 


0.0001 




244261_at 


0.0001 


0.0001 




32541_at 


0.0001 


0.0001 


PPP3CC 


35974_at 


0.0001 


0.0001 


LRMP 


37152_at 


0.0001 


0.0001 


PPARD 


37831_at 


0.0001 


0.0001 


KIAA0545 


38149_at 


0.0001 


0.0001 


KIAA0053 


41220_at 


+ 0.0001 


0.0001 


MSF 


41660_at 


0.0001 


0.0001 


CELSR1 


44790_s_at 


+ 0.0001 


0.0001 


FLJ21562 


49485__at 


0.0001 


0.0001 


PRDM4 


54970_at 


0.0001 


0.0001 


DKFZP761 12123 


74694_s_at 


0.0001 


0.0001 


FLJ23282 
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AFFX-HUMGAPDH/M33197_3_at-HG-U133A 0.0001 0.0001 GAPD 

AFFX-HUMGAPDH/M33197_3_at + 0.0001 0.0001 GAPD 

- HG-U133B 

AFFX-HUMGAPDH/M33197_M_at + 0.0001 0.0001 GAPD 

- HG-U133A 
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Table 43a: Functional annotation of all genes identified with relevance to leukemia subtyping 



Locus 



ID 


Gene Symbol 


Unigene_Cluster 


SWISS-PROT 


Link 


Map Location 


1007_s_at_HG-U133A 


DDR1 


Hs.75562 


Q08345 


780 


6p21.3 


1729_at_HG-U133A 


' TRADD 


Hs.89862 


Q 15628 


8717 


16q22 


200Q08_s_at_HG-U1 33A 


GDI2 


Hs.56845 


P50395 


2665 


10p15 


200008_s_at_HG-U1 33B 


GDI2 


Hs.56845 


P50395 


2665 


10p15 


200023_s_at_HG-U1 33A 


EIF3S5 


Hs.781 1 


000303 


8665 


2p16.1 


200040_at_HG-U133A 


KHDRBS1 


Hs.1 19537 


Q99760 


10657 


1p32 


200047_s_at_HG-U1 33A 


YY1 


Hs.97496 


P25490 


7528 


14q 


200047_s_at_HG-U1 33B 


YY1 


Hs.97496 


P25490 


7528 


14q 


200056_s_at_HG-U1 33A 


C1D 


Hs.15164 


Q13901 


10438 


2p13-p12 


2G0068_s_atJHG-U1 33B 


CANX 


Hs.1 55560 


P27824 


821 


5q35 


200071_at_HG-U133A 


SPF30 , 


Hs.79968 


075940 


10285 


10q23 


200072_s_at_HG-U 1 33A 


HNRPM 


Hs.79024 


P52272 


4670 


19p13.3-p13.2 


200093_s_at_HG-U1 33A 


HINT1 


Hs.256697 


P49773 


3094 


5q31.2 


200094_s_at_HG-U1 33A 


EEF2 


Hs.75309 


P13639 


1938 


19pter-q12 


200598_s_at_HG-U1 33A 


TRA1 


Hs.82689 


P14625 


7184 


12q24.2-q24.3 


200608_s_at_HG-U1 33A 


RAD21 ' 


Hs.81848 


O60216 


5885 


8q24 


200620_at_HG-U133A 


C1orf8 


Hs.11441 


Q9BXS4 


9528 


1p36-p31 


200625_s_at_HG-U1 33A 


CAP 


Hs. 1041 25 


Q01518 


10487 


1p34.3 


200630_x_at_HG-U1 33A 


SET 


Hs. 145279 


Q15541 


6418 


9q34 


200631 _s_at_HG-U133A 


SET 


Hs. 145279 


Q01105 


6418 


9q34 


200646_s_at_H G-U 1 33A 


NUCB1 


Hs. 172609 


Q96BA4 


4924 


19q13.2-q13.4 


200649_at_HG-U 1 33A 


NUCB1 


Hs. 172609 


Q96BA4 


4924 


19q13.2-q13.4 


200650_s_at_HG-U1 33A 


LDHA 


Hs.2795 


P00338 


3939 


11p15.4 


200654_at_HG-U 1 33A 


P4HB 


Hs.75655 


P07237 


5034 


17q25 


200655_s_at_HG-U 1 33A 


CALM1 


Hs. 177656 


AAH08597 


801 


14q24-q31 


200656_s_atJHG-U1 33A 


P4HB 


Hs.75655 


AAH 10859 


5034 


17q25 


200659_s_at_HG-U1 33A 


PHB 


Hs.75323 


P35232 


5245 


17q21 


200661_atJHG-U133A 


PPGB 


Hs. 11 81 26 


P10619 


5476 


20q13.1 


200665_s_at_HG-U1 33A 


SPARC 


Hs.1 11779 


AAH0801 1 


6678 


5q31.3-q32 


200673_atJHG-U 1 33A 


LAPTM4A 


Hs.1 11894 


Q15012 


9741 


2p24.3 


200675_at_HG-U1 33A 


CD81 


Hs.54457 


P18582 


975 


11p15.5 


200679_x__at_HG-U1 33A 


HMG1 


Hs.337757 


P09429 


3146 


13q12 


200707_atJHG-U133A 


PRKCSH 


Hs.1 432 


Q96D06 


5589 


19p13.1-p13.2 


200742_s_at_HG-U1 33A 


CLN2 


Hs.20478 


O 14773 


1200 


11p15 


200743_s_at_HG-U1 33A 


CLN2 


Hs.20478 


014773 


1200 


11p15 


200765_x_at_HG-U1 33A 


CTNNA1 


Hs.1 78452 


AAH00385 


1495 


5q31 


200770_s_at_HG-U1 33A 


LAMC1 


Hs.214982 


P11047 


3915 


1q31 


200825_s_at_HG-U1 33A 


ORP150 


Hs.277704 


Q9Y4L1 


10525 


11q23.1-q23.3 
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20G829_x_at_HG-U1 33A 


2NF207 


Hs.62112 


043670 


7756 


17q11.2 


200832_s_at_HG-U1 33A 


SCD 


Hs. 11 9597 


000767 


6319 


10q23-q24 


200867_at_HG-U1 33A 












200872_at_HG-U133A 


S100A10 


Hs. 11 9301 


P08206 


6281 


1q21 


200890_s_atJHG-U1 33A 


SSR1 


Hs.250773 


P43307 


6745 


6p23 


200906_s_at_HG-U1 33A 


i 










200907_s_at_HG-U1 33A 


KIAA0992 


Hs. 194431 


Q8WX93 


23022 


4q32.3 ♦ 


20091 7_s_at_HG-U1 33A 


SRPR 


Hs.75730 


P08240 


6734 


11q23-q24 


200931 _s_at_HG-U1 33A 


VCL 


Hs.75350 


P18206 


7414 


10q22.1-q23 


200935_at_HG-U133A 


CALR 


Hs. 16488 


P27797 


811 


19p13.3-p13.2 


200951_s_at_HG-U1 33A 


CCND2 


Hs.75586 


P30279 


894 


12p13 


200952_s_aLHG-U1 33A 


CCND2 


Hs.75586 


P30279 


894 


12p13 


200953_s_at_HG-U1 33A 


CCND2 


Hs.75586 


P30279 


894 


12p13 


200967_at_HG-U1 33A 


PPIB 


Hs.699 


Q9BVK5 


5479 


15q21-q22 


200968_s_at_HG-U1 33A 


PPIB 


Hs.699 


P23284 


5479 


15q21-q22 


200984_s_at_HG-U1 33A 


CD59 


Hs.278573 


P13987 


966 


11p13 


200985_s_at_HG-U1 33A 


CD59 


Hs.278573 


AAH01506 




11p13 


200986_at_HG-U1 33A 


SERPING1 


Hs. 151 242 


P05155 


710 


11q12-q13.1 


200996_at_HG-U1 33A 


ACTR3 


Hs.5321 


AAD51904 


10096 


2q13 


200998_s_at_HG-U1 33A 


GKAP4 


Hs.74368 


Q07065 


10970 


12q23.3 


201004_at_HG-U133A 


SSR4 


Hs. 1021 35 


Q96BQ0 


6748 


Xq28 


201005_at_HG-U133A ' 


CD9 


Hs.1244 


P21926 


928 


12p13.3 


201 008_s_at_HG-U1 33A 


TXNIP 


Hs. 179526 


Q 16226 


10628 


1q12 


201012_at_HG-U133A 


ANXA1 


Hs.78225 


P04083 


301 


9q12-q21.2 


201 01 5_s_atHG-U1 33A 


JUP 


Hs.2340 


Q15151 


3728 


17q21 


201 029_s_at_HG-U1 33A 


MIC2 


Hs. 177543 


AAH02584 


4267 


Xp22.32 


201034_atJHG-U133A 


ADD3 


Hs.324470 


Q9UEY8 


120 


10q24.2-q24.3 


201 061_s_at_HG-U1 33A 


EPB72 


Hs. 160483 


P27105 


2040 


9q34.1 


201 069_at JHG-U1 33A 


MMP2 


Hs.111301 


P08253 


4313 


16q13-q21 


201 079_at_HG-U133A 


SYNGR2 


Hs.5097 


043760 


9144 


17qter 


201 105_atJHG-U133A 


LGALS1 


Hs.227751 


P09382 


3956 


22q13.1 


201 1 25_s_at_HG-U 1 33A 


ITGB5 


Hs. 149846 


AAH06541 


3693 


3q21.3 


201 137_s_at_HG-U133A 


HLA-DPB1 


Hs.814 


Q30174 


3115 


6p21.3 


201 1 52_s_at_HG-U1 33A 


MBNL 


Hs.28578 


Q96P92 


4154 


3q25 


201 1 53_s_at_HG-U1 33A 


MBNL 


Hs.28578 


Q96RE3 


4154 


3q25 


201 1 62_at_HG-U133A 


IGFBP7 


Hs. 11 9206 


Q16270 


3490 


4q12 


201 163_s_at_HG-U133A 


IGFBP7 


Hs. 11 9206 


Q07822 


3490 


4q12 


201 1 64_s_at_HG-U1 33A 


PUM1 


Hs. 153834 


Q14671 


9698 


1p35.2 


201 1 89_s_at_HG-U1 33A 


ITPR3 


Hs.77515 


Q14573 


3710 


6p21 


201 200_at_HG-U133A 


CREG 


Hs.5710 


075629 


8804 


1q24 


201247_at_HG-U133A 


SREBF2 


Hs. 108689 


Q12772 


6721 


22q13 


201263_jatJHG-U133A 


TARS 


Hs.84131 


P26639 


6897 


5p13-cen 
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201278_at_HG-U133A 


DAB2 


Hs.81988 


P98082 


,1601 


5p13 


2<Jl 281_atJ-IG-U 133A 


ADRM1 


Hs.90107 


Q16186 


11047 


20q13.33 


201324_at_HG-U133A 


EMP1 


Hs.79368 


P54849 


2012 


12p12.3 


201 334_s_at_HG-U1 33A 


ARHGEF12 


Hs.6582 


015086 


23365 


11q23.3 


201 338_x_at_HG-U1 33A 


GTF3A 


Hs.75113 


Q9H2K0 


2971 


13q12.3-q13.1 


201352_at_HG-U133A 


' YME1L1 


Hs.206521 


Q9UMR9 


10730 


10p14 


201 358_s_at_HG-U1 33A 


COPB 


Hs.3059 


AAL39009 


1315 


11p15.2 


201 359_at_HG-U1 33A 


COPB 


Hs.3059 


P53618 


1315 


11p15.2 


201 360_at_HG-U1 33A 


CST3 


Hs. 135084 


P01034 


1471 


20p11.21 


201 362_at_HG-U133A 


NS1-BP 


Hs. 197298 


Q9Y480 


10625 


1q25.1-q31.1 


201377_at_HG-U133A 


KIAA0144 


Hs.8127 


Q9BTU3 


9898 


1q21.2 


201 396_s_jat_HG-U1 33A 


SGT 


Hs.203910 


043765 


6449 


19p13 


201416_at_HG-U133A 


SOX4 


Hs. 83484 


Q99687 


6659 


6p22.3 


201417_at_HG-U133A 












201425_jat_HG-U133A 


ALDH2 ' 


Hs. 195432 


P05091 


217 


12q24.2 


201 433_s_at_HG-U1 33A 


PTDSS1 


Hs.77329 


Q9BUQ5 


9791 


8q22 


201 437_s_at_HG-U1 33A 


EIF4E 


Hs.79306 


P06730 


1977 


4q21-q25 


201 462_at_HG-U 1 33A 


KIAA0193 


Hs.75137 


BAA12106 


9805 


7p14.3-p14.1 


201 463_s_at_HG-U1 33A 


TALDOI 


Hs.77290 


P37837 


6888 


11p15.5-p15.4 


201 482_at JHG-U1 33A 


QSCN6 


Hs.77266 


000391 


5768 


1q24 


201 485_s_at_HG-U1 33A 


RCN2 


Hs.79088 


AAH04892 


5955 


15q23 


201 486_at_HG-U1 33A 


RCN2 


Hs.79088 


AAH04892 


5955 


15q23 


201496_x_at_HG-U1 33A 


MYH11 


Hs.78344 


014729 


4629 


16p13.13-p13.12 


201 497_x_at_HG-U1 33A 


MYH11 


Hs.78344 


01 4729 


4629 


16p13.13-p13.12 


201 506_at_HG-U1 33A 


TGFBI 


Hs. 11 8787 


Q96IU7 


7045 


5q31 


201 530_x_at_HG-U 1 33A 


EIF4A1 


Hs. 129673 


Q9BRB1 


1973 


17p13 


201537_s_at_HG-U1 33A 


DUSP3 


Hs.181046 


Q96HQ4 


1845 


17q21 


201 540_at_HG-U1 33A 


FHL1 


Hs.239069 


Q13642 


2273 


Xq26 


201 548_s_at_HG-U1 33A 


PLU-1 


Hs. 143323 


09581 1 


10765 


1q32.1 


201 564_s_at_HG-U1 33A 


SNL 


Hs. 11 8400 


Q96IH1 


6624 


7p22 


201 585_s_at_HG-U1 33A 


SFPQ 


Hs. 180610 


P23246 


6421 


1p34.2 • 


201592__at_HG-U133A 


EIF3S3 


Hs.58189 


015372 


8667 


8q24.11 


201 595_s_at_HG-U1 33A 


HT010 


Hs.6375 


Q9NZ37 


55854 


2q32.2 


201 596_x_at_HG-U1 33A 


KRT18 


Hs.65114 


Q96GD2 


3875 


12q13 


201 602_s_at_HG-U1 33A 


PPP1R12A 


Hs. 16533 


Q96L23 


4659 


12q15-q21 


201 640_x_at_HG-U1 33A 


CLPTM1 


Hs. 106671 


Q9BSS5 


1209 


19q13.2-q13.3 


201644_atJHG-U133A 


TSTA3 


Hs.264428 


Q13630 


7264 


8q24.3 


201652_at_HG-U133A 


COPS5 


Hs. 198767 


Q9BQ17 


10987 


8q12.3 


201 753_s_at_HG-U1 33A 


ADD3 


Hs.324470 


Q9UEY8 


120 


10q24.2-q24.3 


201761_at_HG-U133A 


MTHFD2 


Hs. 154672 


P13995 


10797 


2p12 


201 81 0_s_at_HG-U1 33A 


SH3BP5 


Hs. 1091 50 


O60239 


9467 


3p24.3 


201 81 1_x_atJHG-U1 33A 


SH3BP5 


Hs. 1091 50 


Q9BRM4 


9467 


3p24.3 
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201 81 2__s_at_HG-U 1 33A 


LOC54543 


201813 s at HG-U133A 


KIAA0210 


901825 s at HG-U133A 


LOC51097 


201828 x at HG-U133A 


CXX1 


201829 at_HG-U133A 


NET1 


201830 s at HG-U133A 

i www w ai i ■ w • ww» » 


NET1 


201851 at HG-U133A 


SH3GL1 


201853 s at HG-U133A 


CDC25B 


201858 s at HG-U133A 
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228834__at_HG-U133B 
228904_at_HG-U1 33B 
22891 6_at_HG-U1 33B 
228984_atJHG-U133B 
228988_at_HG-U133B 
229001 _at_HG-U133B 
229003__x_at_HG-U1 33B 
229061_s_at_HO-U133B 
229070_at_HG-U133B 
229072_atJHG-U133B 
229138_at_HG-U133B 
229168_at_HG-U133B 
229194_at_HG-U133B 
229232_at_HG-U 1 33B 
229253_at_HG-U133B 
229280_s_at_HG-U1 33B 
229302_aLHG-U133B 
22931 0_aL.HG-U133B 
229344_x_atJHG-U1 33B 
229362_at_HG-U133B 
229368_s_at_HG-U1 33B 
229383_at_HG-U133B 
229420_at_HG-U 1 33B 
229487_at__HG-U1 33B 
22951 3_at_HG-U133B 
229575_atJHG-U133B 
229621_x_at_HG-U1 33B 
229637_atJHG-U133B 
229659_s_at_HG-U1 33B 
229681_at_HG-U133B 
22971 1_s_at_HG-U133B 
229722_at_H G-U 1 33B 
229745_x_at_HG-U1 33B 
229750_at JHG-U1 33B 
229779_at_HG-U1 33B 
229790_atJHG-U1 33B 
229817 at HG-U133B 



C20orf100 



TOB1 



KIAA1394 
ZNF6 



SLC25A13 
MGC12335 



CTMP 

KIAA1921 
KIAA1238 

ZNF216 

RPL23A 



MGC5370 
HSPC072 



TERF2 

DKFZP434M098 



Hs.55098 

Hs.25318 

Hs.26608 

Hs.325823 

Hs.90858 

Hs.178137 

Hs. 156044 

Hs.24380 

Hs.32156 

Hs. 326801 

Hs.39911 

Hs.351871 

Hs.3599 

Hs.97411 

Hs.26339 

Hs.59698 

Hs. 55407 

Hs. 126695 

Hs. 194071 

Hs.293691 

Hs.40808 
Hs. 348392 
Hs.236463 
Hs. 374508 
Hs.3776 
Hs. 107369 
Hs.350046 
Hs. 120785 

Hs. 122642 

Hs.348805 

Hs.25768 

Hs.205126 

Hs.17551 

Hs.332938 

Hs.87329 

Hs.134185 

Hs. 143087 

Hs. 119471 

Hs. 100030 

Hs.93738 



Q96NM4 



P50616 

Q96MI1 
Q9P2F3 
Q9Y4J6 
Q9H7J1 

Q9UJS0 
Q96IZ2 



Q9NT93 



Q96AB5 



Q96PW7 
Q9ULI2 

076080 



Q96J14 
Q96B18 



AAH24890 
Q9UFL4 



84969 20q13.11 



10140 17q21 



57571 11q12.2 
7552 Xq13-q21.1 



10165 7q21.3 
84830 6p22.3 



117145 1q21 



114818 2p23.3 
57494 12p13.32 

7763 9q13-q21 



AAH14459 6147 17q11 



84825 12q13.2 
29075 20p11.23 



7014 16q22.1 
25863 
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229833_at_HG-U133B 




Hs.33728 








229844_at_HG-U 1 33B 




Hs.59368 








229848_atJHG-U133B 


2NF10 


Hs. 1041 15 


AAH24182 


7556 


12q24.33 


229872_s_at_HG-U1 33B 


KIAA0493 


Hs.36475 




57234 


1q21.3 


229923_atJHG-U1 33B 












229934_at_H G-U 1 33B 


i 


Hs.38218 








229937_x_atJHG-U1 33B 




Hs. 149924 






• 


230110_at_HG-U133B 




Hs.210792 








230245_s_at_HG-U1 33B 




Hs. 181 297 








230292_at_HG-U1 33B 




Hs.25447 


Q96DJ9 






230363_s_at_HG-U1 33B 


SAC2 


Hs.52463 


Q9Y2H2 


22876 


10q26.13 


230405_at_HG-U1 33B 


RAD50 


Hs.41587 


043254 


10111 


5q31 


230441_at_HG-U133B 




Hs.374272 








230489_atJHG-U1 33B 


CDS 


Hs.58685 


AAH27901 


921 


11q13 


230499_at_HG-U 1 33B 




Hs. 121 572 








230526_at_HG-U133B 


FLJ20015 


Hs.80618 


Q96MU9 


54459 


17q25 


230551_atJHG-U133B 




Hs. 107331 








230588_s_at_HG-U1 33B 


MCPR 


Hs.40137 


Q9H8D0 


64682 


2q12.1 


230636_s_at_HG-U1 33B 


BTEB1 


Hs. 150557 


Q 13886 


687 


9q13 


230648_at_HG-U133B 




Hs. 181 297 








230650_at_HG-U1 33B 




Hs. 152460 








230659_at_HG-U1 33B ■ 


KIAA0212 


Hs. 154332 


Q9261 1 


9695 


3p26.1 


230689_at_HG-U133B 




Hs. 163426 








23071 3_at_HG-U133B 




Hs. 1821 85 








230743_at_HG-U1 33B 




Hs. 183096 


Q96FJ8 






230753_at_HG-U133B 




Hs.11594 








230768_at_HG-U133B 




Hs.58753 








230775_s_at_HG-U1 33B 


KIAA0610 


Hs. 11 8087 


060349 


23111 


13q13.1 


230793_at_HG-U 1 33B 


FLJ20048 


Hs. 11 6470 


Q9NXU7 


55604 


6p21.32 


230795_at_HG-U 1 33B 


H4F2 


Hs.55468 


AAH19846 


3022 


1q21 


230802_at JHG-U 1 33B 


DKFZP564B1162 


Hs.93589 


Q9H0T6 


83478 


4q21.23-q21.3 


230803_s_at_HG-U1 33B 


DKF2P564B1162 


Hs. 93589 


Q9H0T6 


83478 


4q21.23-q21.3 


230834__at_HG-U1 33B 




Hs. 11 451 6 








230877_atJHG-U1 33B 


IGHG3 


Hs. 300697 


AAH24289 


3502 


14q32.33 


23091 7_at_HG-U1 33B 




Hs.372303 








230939_at_HG-U1 33B 




Hs. 130352 








230960_atJHG-U1 33B 




Hs. 128292 








230983_at_HG-U1 33B 




Hs. 13731 9 








230986_at_HG-U1 33B 




Hs. 141 120 








230988_at_HG-U133B 




Hs.15155 








230997_at_HG-U133B 




Hs.131816 








231 0Q2_s_at_HG-U1 33B 


NUP88 


Hs. 1721 08 


Q99567 


4927 


17p13 
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OQincn ^* i_i i mood 
Zol U0U__at_nCj-U1 33 B 


HRLP5 


00*1 ocd _ a i mood 

231 Ub9__at_nG-U 1 33 B 




231 Uy 3_at_n G-U 1 3 3 B 


FCRH3 


OQ-i-in"! ur* t mood 
Zo\ lU1__at_no-U133B 


PPP2R5E 


004 H <tO _i II /"^ | MOOD 

231 \ 1 o_at_HC3-U1 33B 




231 1 81_at_HG-U1 33 B 




231 21 5_at_HG-U 1 33B 




231241_at_HG-U133B 




OO^iOCrt x l l i i j Ann 

231259_s_at_HG-U133B 


CCND2 


23 1 269_at_ HG-U 1 33 B 


DJ467N11.1 


231277_x_at_HG-U1 33 B 




231 31 0_at_HG-U1 33B 




231 332_at_HG-U1 33B 




OOM >1oo — x i i/-% i ■ .4 «-«.«-> r*t 

23 1 406_at_HG-U 1 33B 




231 41 8_at_HG-U 1 33B 


MS4A2 , 


231 472_at_HG-U 1 33 B 




OOHCOO — m ■ • /"\ t ijinnn 

231538_at_HG-U133B 


FLJ23499 


231 552_at_HG-U 1 33 B 




231 567_s_at_HG-U1 33B 


TSP-NY 


231 647_s___at_HG-U1 33B 


IRTA2 


231656_x__at_HG-U1 33B 


OSBPL10 


OO "70 0 — 1. II | i j 

231736_x_at_HG-U1 33B 


MGST1 


231775_at_HG-U133B 


TNFRSF10A 


231 793_s_at_HG-U1 33 B 


CAMK2D 


231 794_at_HG-U1 33B 


CTLA4 


231 837_at__HG-U133B 


USP28 


231 854_at_HG-U1 33B 




00*4 0~7/"l _ _x t 1 f> 1 Id ^"»<^r"» 

231 o70_s_at_HG-U1 33 B 


LOC51068 


Zolof o__ai — nb-U i ood 




231 887_s_at_HG-U1 33B 


KIAA1274 


231 897_at_H G-U 1 33 B 




231 92o_at_HG-U1 33 B 




OO -1 H/l C II /"N • MOOD 

^ol y4o_at_no-U1 33B 


KIAA1275 


oo a ncn _ x ma ■ u Aon 

231 yby_at_HG-U1 33B 


DKFZp762K2 


OQ -1 QQH ^.i. UP 1 MOOD 

iyoi_at_HG-U133B 




231982_at_HG-U133B 




232027_at_HG-U1 33B 




232035__at_HG-U1 33B 


H4FH 


232060_at_HG-U 1 33B 




232080_at_HG-U 1 33B 


KIAA1301 


232112_at_HG-U133B 




232125_at_HG-U133B 
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Hs. 120869 


Q96KN8 


( 117245 


11q13.2 


Hs.295727 






Hs. 123296 


Q96LA4 


115352 


1q21-q22 


Hs. 173328 


Q 16537 


5529 


7p11.2-p12 


Hs. 133081 


Q96LS3 






Hs.88651 








Hs. 126962 








Hs. 155635 








Hs.75586 


P30279 


894 


12p13 


Hs. 14391 7 




63921 


6q16.1-q16.3 


Hs. 14822 








Hs. 11 31 70 








Hs.254986 








Hs. 129568 








Hs.89751 


AAH02807 


931 


11q12-q13.1 


Hs. 124087 








Hs. 17546 


Q9H5F2 


64776 


11q13-q22 


Hs.235240 








Hs. 97643 


Q9BYZ5 


84660 


12q24.31 


Hs.191958 


Q96RD9 


83416 


1q21 


Hs.321622 


Q9BTU5 


114884 


3p22.3 


Hs.355733 


P10620 


4257 


12p12.3-p12.1 


Hs.249190 


000220 


8797 


8p21 


Hs.1 11460 


Q13557 


817 


4q25 


Hs.247824 


Q96P43 


1493 


2q33 


Hs. 142856 


Q96RU2 


57646 


11q23 


Hs. 120905 








Hs. 181 022 


Q9Y226 


51068 




Hs.315284 








Hs.300646 


BAA86588 


27143 


10q22.3 


Hs. 102796 


Q96SK6 


27145 


6q14.2 


Hs. 159200 


Q9P2F5 


56977 


4q35.1 


Hs.93758 


AAH10926 


8365 


6p21.3 


Hs.8707 


Q9NPS9 


57520 


2q33.1 



Hs.220745 
Hs.202577 

1076 
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232127_at_HG-U133B 
232201_at_HG-U133B 
232204_at_HG-U1 33B 
23221 0_at_HG-U133B 
232231_at_HG-U133B 
232234_at_H G-U 1 33B 
232614_atJHG-U133B 
232636_atJHG-U133B 
23264 1 _at_H G-U 1 33 B 
232739_at_HG-U1 33B 
232841_at_HG-U133B 
232950_s_at_HG-U1 33B 
233072_at_HG-U 1 33B 
233106_at_HG-U133B 
233137_atJHG-U133B 
233138_at_HG-U133B 
2331 77_s_at_HG-U1 33B 
233 1 95_at_HG-U 1 33B 
233261 _at_HG-U 1 33B 
233328_x_at_HG-U1 33B 
233483_at_HG-U 1 33B 
233500_x_at_HG-U1 33B 
233520_s_at_HG-U1 33B 
233555_s_at_HG-U1 33B 
233559_s_at_HG-U1 33B 
233589_x_at_HG-U1 33B 
23361 3_x_at_HG-U1 33B 
23381 3_at_HG-U133B 
233845_at_HG-U1 33B 
233955_x_at_HG-U1 33B 
234005_x_at_HG-U1 33B 
234032_at_HG-U1 33B 
2341 07_s_atJHG-U1 33B 
2341 32_atJHG-U1 33B 
2341 40_s_at_HG-U1 33B 
234362_s_atJHG-U1 33B 
234584_s_at_HG-U1 33B 
234643_x_at_HG-U1 33B 
234660_s_at_HG-U1 33B 
234682_at_HG-U133B 
234725_s_at_HG-U1 33B 
234726 s at HG-U133B 



NKD2 
EBF 



C20orf24 



NIR3 

KIAA1857 



KIAA1184 



LLT1 



FENS-1 



HSPC195 
STK36 



STIM2 
CTLA4 
ATE1 

DIS3 

SEMA4B 



Hs.240951 

Hs.32425 

Hs.206868 

Hs. 184062 
Hs. 171 395 



Hs.272759 
Hs. 163642 
Hs.367825 

Hs. 178533 
Hs. 100747 

Hs.293916 

Hs. 193857 
Hs. 136748 



Hs.44743 

Hs.296737 

Hs.283921 
Hs. 15093 
Hs.26996 



Hs.278894 
Hs.247824 
Hs.355315 

Hs.323346 

Hs.9598 



Q969F2 
Q9UH73 



Q9UI05 



Q96IG7 



Q9H8N9 



Q9UI64 

AAH17439 

Q9H9N9 



Q9P246 
095653 
O95260 

Q8WWI2 

Q8WY86 



85409 5p15.3 
1879 5q34 



55969 20q11.22 



Q9BZ72 57605 12q24.31 

BAB47486 84628 9q34 



64583 2q36.1 



Q9H7M0 

Q8WUP7 29121 12p13 



57590 2q36.3 



51523 5q31.3 
27148 2q36.1 



57620 4p15.2 

1493 2q33 

11101 10q26.3 

22894 1 3q21.33 

10509 15q25 



1077 



BNSDOCID: <WO_ 



_03039443A2_I_ 



WO 03/039443 PCT/EP02/1 2303 



234764, 
234839. 
234862, 
235023, 
235051_ 
235052. 
235061_ 
235101. 
235124, 
235259, 
235263, 
235278. 
235287. 
235291, 
235331, 
235353, 
235359, 
235372. 
235385. 
235400. 
235401. 
235414. 
235421. 
235422. 
235428. 
235444. 
235446. 
235459. 
235483. 
235502. 
235521. 
235604. 
235647. 
235674. 
235692. 
235706. 
235719. 
235749. 
235753. 
235818. 
235823. 
235982 



xjat_HG-U133B 

at_HG-U133B 

at_HG-U133B 

at_HG-U133B 

at_HG-U133B 

at_HG-U133B 

at_HG-U133B 

at_HG-U133B 

at_HG-U133B 

at_HG-U133B 

at_HG-U133B 

at_HG-U133B 

at_HG-U133B 

s_at_HG-U133B 

x_at_HG-U133B 

at_HG-U133B 

aLHG-U133B 

at_HG-U133B 

at_HG-U133B 

at_HG-U133B 

s_at_HG-U133B 

at_HG-U133B 

at_HG-U133B 

_at_HG-U133B 

at_HG-U133B 

at_HG-U133B 

at_HG-U133B 

at_HG-U133B 

at_HG-U133B 

at_HG-U133B 

atJHG-U133B 

x_at_HG-U133B 

_at_HG-U133B 

at_HG-U133B 

at_HG-U133B 

_at_HG-U133B 

3LHG-U133B 

at_HG-U133B 

at_HG-U133B 

at_HG-U133B 

at_HG-U133B 

at HG-U133B 



KIAA1014 



DKFZP434A0131 



KIAA0746 

FREB 
FLJ20668 
FREB 
FREB 



FALZ 



HOXA3 



CPM 



FCRH1 



Hs.292896 

Hs.55098 

Hs. 50405 

Hs.291000 

Hs.6834 

Hs. 153400 

Hs.97415 

Hs. 15423 

Hs. 136585 

Hs. 180059 

Hs.268180 

Hs. 126695 

Hs.49500 

Hs. 1621 85 

Hs.266331 

Hs. 12920 

Hs.266331 

Hs.266331 

Hs. 1641 02 

Hs. 351 250 

Hs.99872 

Hs. 104480 

Hs.235860 

Hs.293702 

Hs. 13631 6 

Hs. 180711 

Hs.79204 

Hs.248074 

Hs.256801 

Hs.29341 1 

Hs. 13849 

Hs.31922 

Hs.334873 

Hs.99237 

Hs. 1961 69 
Hs. 133355 
Hs.49892 
Hs. 180644 



Q96NT4 
Q9H985 



23360 11q12.1 



AAH23002 

Q9NT73 54441 7q11.23-q21.1 

AAH27989 



094847 

Q96PJ4 
Q9NWR0 
AAM13980 
AAM13980 



Q12830 



23231 4p15.31 

84824 1q23.1 

55016 4q32.2 

84824 1q23.1 

84824 1q23.1 



2186 



17q24 



043365 



3200 



7p15-p14 



P14384 



1368 



12q15 



Q96LA6 



115350 1q21-q22 
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236019_atJHG-U133B 
2361 90_atJHG-U133B 
236226_at_HG-U1 33B 
236248_x_at_HG-U133B 
236265_at_HG-U 1 33B 
236280_at_HG-U 1 33B 
236293_at_HG-U 1 33B 
23630 1 _at_H G-U 1 33B 
236341 _at_HG-U 1 33B CTLA4 
236378_at_HG-U1 33B 
236458_at_HG-U 1 33B 
236535_atJHG-U133B FLJ221 16 

236557_atJHG-U1 33B 
236606_at_HG-U 1 33B 
236648_at_HG-U 1 33B 
236656_s_at_HG-U1 33B 
236707_at_HG-U133B DAPP1 
236728_at_HG-U1 33B 
236787_at_HG-U 1 33B 
236796_at_HG-U 1 33B 
236837_x_at_HG-U1 33B 
236854_atJHG-U1 33B , 
236892_s_at_HG-U1 33B HOXB6 
236908_at_HG-U133B 
23691 4_at_HG-U133B 
236979_at_HG-U133B 
237006_at_HG-U1 33B 
237068_at_HG-U 1 33B 
237291 _at_H G-U 1 33B 
237337_atJHG-U133B 
23741 1_at_HG-U1 33B LOC1 5351 6 

237431_atJHG-U133B 
237864_at_HG-U1 33B 
23801 2_at_HG-U133B 
238022_at_HG-U1 33B 
238026_at_HG-U1 33B RPL35A 
238039_at_HG-U133B 
238055_at_HG-U1 33B 
238057_at_HG-U 1 33B 
238066_at_HG-U1 33B RBP7 
238 1 55_at_HG-U 1 33B 
238208 at HG-U133B 



Hs. 19440 
Hs.1 15772 
Hs. 126808 
Hs. 167090 
Hs.88013 
Hs. 176920 
Hs.283313 
Hs.46645 
Hs.247824 
Hs.231898 
Hs.1 63426 
Hs. 34497 
Hs.1 86838 
Hs.47150 
Hs.1 44057 
Hs.265499 
Hs.62643 
Hs. 26330 
Hs.1 26630 
Hs.1 18659 
Hs.1 291 37 
Hs.48984 
Hs. 98428 
Hs.1 43905 
Hs.1 31 601 
Hs.1 68974 
Hs.24128 
Hs.200815 
Hs.1 59362 
Hs.1 15580 
Hs.38173 
Hs.97562 
Hs.1 96988 
Hs.37916 
Hs.237396 
Hs.287361 
Hs.27167 

Hs.252124 
Hs.292718 
Hs.1 71 689 
Hs.1 131 06 



095653 



1493 2q33 



Q9H6M0 79677 2p24.1 



CAD28547 27071 4q25-q27 



P17509 



3216 17q21-q22 



Q96FQ0 



153516 5q12.1 



P18077 



6165 3q29-qter 



Q96R05 



116362 1p36.22 
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238304_at_HG-U1 33B 




Hs.21868 


238365_s_at_HG-U 1 33B 




Hs. 158272 


238367_s_at_HG-U1 33B 




Hs. 158272 


238376_at_HG-U1 33B 




Hs. 169738 


238392_at_HG-U133B 




Hs. 145480 


23851 6_at_HG-U133B 


BMPR2 


Hs.53250 


238583_at_HG-U 1 33B 




Hs. 133878 


238587_atJHG-U1 33B 


MGC15437 


Hs.55067 


238593_at_HG-U133B 




Hs. 152003 


238604_at_HG-U1 33B 




Hs. 140489 


238651 _at_HG-U133B 




Hs.23096 


238652_at_HG-U133B 




Hs.313541 


238686_at_HG-U1 33B 


FBX03 


Hs. 16577 


238750_at_HG-U 1 33B 




Hs.218707 


238752_at_HG-U133B 


• 


Hs. 1221 55 


238756_at_HG-U1 33B 




Hs.41294 


238790_at_HG-U133B 




Hs. 1561 00 


238791_atJHG-U133B 




Hs.267124 


238824_at_HG-U1 33B 




Hs. 145569 


238856_s_at_HG-U1 33B 




Hs.72045 


239054_j3t_HG~U 1 33B 




Hs. 14461 6 


239071_atJHG-U133B 




Hs. 173904 


239122_at_HG-U133B 


IL24 


Hs.315463 


239152_at_HG-U133B 




Hs.243023 


23921 4_at JHG-U 1 33B 




Hs. 123244 


239229_at_HG-U 1 33B 




Hs.50125 


239231_atJHG-U133B 
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1007_s_at HG-U133A 
cacccagctggtcctgtggatgggatcctct^ 
5 racctgggccccactggacaacartgattcctggacaggtggctgcgcccccagcttctctctccctgtcaca^ 
ggaggacaagaaggagaggaaaatgtttccttgtgcctgctcctgtacttgtcctcagcttgggcttcttcctcctccat^ 
cgcccxagc(Xtcagtc^ccccccacttccc^cctgcagtcttgtagctagaacttctctaagcctatacg 



aatatatggacctagcttgaggcaat 

10 

1729_at HG-U133A 

ggaggcactcgaggagaacgagctcaccagcctggcagaggacttg 

ggagaacx:tggatggccttagggttccttctgcggctattgctgaacccctgtccatccacgggaccctgaaact 



15 ctcaccttctcccatcccagaagcggggcttacagcca^ 
cagtgaagta 

200008_s_at HG-U 1 33A 
gctcgct( 
20 caccaai 



200008_s_at HG-U133B 

25 



gggaagtacattgctatagttagtacaactgtggaaaccaaggagcrt^ 

catcagtgacctcctggtaccaaaagacttgggaacagaaagccagatctttatttcccgcacatatgatgccaccactcattttgagac 
30 200023_s_at HG-U133A 



agagtgattggactctraagtgacttgcagcaagtaggaggggca^^ 

aaggtgtcagctgacaatactgtgggccgcttcctgatgagc^tggttaaccaagtaccgaaaatagttcccgatgactttgagaccatgctcaa^^ 
gaccttttgatggtgacctacctggcc^acctcacacagtcacagattgcactcaatgaaaaacttgtaaacctgtgaatg^ 

35 ggtattaaccc 

200040_atHG-U133A 

tgactttttcattacgtgggttttgaaatctagccx:cagacatactgtgttgagagatacttagagggagggagtaggttttg 

gcctcctgaattgagtttgatgcagagctttttagccatgaagaat^^ 
40 ctgcactccatgaaaagtt(^cttggacgctggggccaaaagctgttgattttcttaagttgacggttgtcaatatatcg 



cggaagattaaatgctacatgtgtaagtctgcctaaat 
200047 s at HG-U133A 
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ggttttgmgctatcttaattttggttgtattctttgatgttaacacattttgtataattgtatcgtatagctgtattgaatcatgtagtatcaaat^ 
caatttaaacccattUagtcacttttttmccaaaaaaatart^ 
attgtaccttttaacacctgatgtgtacatcccatgta 

5 200047_s_at HG-U133B 

ggttttgmgctatcttaamtggttgtattcmgatgttaacacatmgtataattgtatcgtatagctgtattgaatcatgtagtatc^aatatta 

caatttaaacccattttagtcacttmtmccaaaaaaa 

attgtaccttttaacacctgatgtgtacatcccatgta 

10 200O56_s_atHG-U133A 

tctttattgattagcactgaggatmaacattgtgatatattatatatttataattt^^ 
cagctgtgaaatttaagtgaaatgttctttgtaaacatttgtactattte^ 

atatgatattaagaaagcgtgaaatgacttaaatgttcatttttttctgtatagatactttatcatgttttcatgattttaggaattactgctttgtt 
aagtttatggttgtactttaattcttggcatgttgcctctatgtcccatttaaaataaaatacattctcattaactttagatgg 

15 

200068_s_at HG-U 1 33A 

cctctcccacaccactatagtgtaataatgttattattactctacactgaaacgtattcagagttagatattattttagcttcagttgttctU 
atactgtttcttgcactgaatatataaacactccacagtgtttatattgggaagat^^^ 
tttgttc^cttaaattcttttgaggatgggatgtatttttcttgctgttcagtgctttttccttttcatctgttgttctgtggtcaca 
20 attgattacaaataaacagttgttacttagraagacctgaaaatatgtctgcaggttto^ 

cctccgttgacagtatatgtcatgcctcactttcttctagctgagctttaaatcattagagcttaaattgtcagatcgttcattgcctttc 

200068_s_at HG-U133B 

cctctcccacacractatagtgtaataatgttattattactctacart^ 
25 atactgtttcttgcactgaatatataaacactccacagtgtttatattgggaagatattgggaaggaaatatatttgtaaaagatgaaggctgtatctam 
tttgttcacttaaattcttttgaggatgggatgtatttttcttgctgtt(^gtgctttttcctmcatctgttgttctgtg 
attgattacaaataaacagttgttacttagcaagacctgaaaatatgtct^ 

cctccgttgacagtatatgtcatgcctcacmcttctagctgagctttaaatcattagagcttaaattgtcagatcgttcattgcctttc 

30 200071_atHG-U133A 

aaccttaggatgagctccctffictaaggtgtttttgtfflttg^ 
aacaaaaagctgatgtatagccctgtatacagtgtagatactatttttgtaaaaacacaagg 

tatttcttgtgcattaatctgacgataatttccctgtatattatgttcatttagctgtttgtaattmgttaattagatcaggttgtctgcattt 
tatcctgagttgagtttaagccaaatacatgcatagaaaagggtcttcctattaatgg 

35 

200072_s_at HG-U133A 

cattgagcgcatgggcctggccatgggtggcggtggcggtgccagctttgaccgtgccatcgagatggagcgtggraacttcggaggaagcttcg 
tggagctggaggcc^tgctcctggggtggccaggaaggcctgccagatatttgtgagaaatctgccattcgatttcacatggaagatgctaaagg 
agtgcggccacgtgctgtacgccgacatcaagatggagaatgggaagtccaaggggtgtggtg^ 
40 ggatgatgaatggcatgaagctgagtggccgagagattgac 

200093_s_at HG-U133A 

ctggtggcgacacgatcmgggaagatcatccgnnannanatacc^gcc^aaatcatttttgaggatgaccggtgccttgctttcc^ 
aacarattttctggtgatacccaagaaacatatatcccagaffictgtggragaaga 
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gatctgggcctgaataagggttatcgaatggtggtgaatgaaggttcagatggte^ 
cctcctggttaagcacgttttggggat 

200094_s_at HG-U133A 

5 cgctgacctgaggtccaacacgggcggccaggcgttcccccagtgtgtgtttgaccactggcagatcctgcccggagaccccttcgancaacanngn 
ccagcraggtnggtggcggagacccgranagcgcaagggcctgaaaga 

cgcctgccgccccggggactcgcagcacccnacagcaccacgtcctcgaattctcagacgacacctggagactgtcccgacacagcga^ 
ttctggggcccgctgcgtgccatc^ctcaaccanaacactta^ 

mgggtacatggcccgcagtggcttttgccccagaaaggggaaaggaacac^cgggtagat^^ 
1 0 cctccagctgtactagtgccattggaata 

200598_s__at HG-U133A 

cctgctgcatgtcacagacaccggtgtaggaatgaccagagaagagttggttaaaaacctt^ 
ctgaagcacaggaagatggccagtraacttctgaattgattggccagm 
1 5 caacgataccragcaratctgggagtctgactccaatgaattttctgta^ 
agaagaagcatctgattaccttgaattgg 

200608_s_at HG-U133A 

gtttggaacmaatagcgttgcaacgaaatcctatatccagmcctgtaatttaattgaagaaaaatacatccaaataaagactttattattaacagaccaga^ 
20 agaaatcatgtgactgttatgattatc^gaatatgtcttaactttttagggcaaagttaacactgaaagttctagcttaagtgttgaaactttt 
ttgaaactcagacttcagtgtatacccaataatttaaaattatgtgaaatgtffi 
aattgctattgcagcmatattcaatatgatgtgcctgtaaaccaaggagtfflcrc^ 
cttgttacacattttgttagtctggtttttgttgcttatcgggtttaatattgttctt 

25 200620_atHG-U133A 

cragtacgcaccaratttggagcaggagcctacaaatttgagagaatcatctctaagcaaaa 

tgaagatggagaaagtgatggctttttaagatgcctctctcttaactctgggtggamtaactacaactcttgtcctctcggt^ 

gctacagrtgtggagcagtatgttccrtctgagaagctgagtatctatggtga^ 

tagatctaaaactgaagatcatgaagaagcagggcrtctacctacaaaagtgaatcttgc^ 
30 ctaaaattccactcctcatagagctttta 



200625_s_at H G-U 1 33A 
aaaccracatgatttcaaggagtctggrattcctgaatccttctt^ 
gtctaagcacctgaacagaggactgaaacctccactgcaggctggttttagg^^ 
35 ctaggacaaacacttaccaaaatatgcaacttttttttggtgggaagagagattgtcctgtgatttctacccatttcc^^ 

200630_x_at HG-U133A 

ctacta^tggttctraatttatttggggggaaataccttgagcagaataraatgggaaaagagt^^^ 
40 aaaactgtatggaatraacaccaccgagctctgtgggaaaaaagaaaaacctgcto 

tagaattctagcttttttccta^ttctctgtatattgggctcagagagtacactgtgtrtctatgtgaatatgga^ 



200631_s_at HG-U133A 
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agacctggtgctctaatgccaagttatacacgggaragttgctggcatgtcttca^ 
gagcagtaactgtccctgctttctggtataaagctctcaaat 
accctaatgcataagctttttaatgctgtaaaatatagtcgctgaaatt 

aaaacttgataaatggaactattccatcaataggcaaaagtgtaacaacctatctagatggatagtatgtaatttctgcacaggtctctgtttag^ 
5 accgatcaggaatcttgctccaa 

200646_s_at HG-U 1 33A 

accctgcctacaccgaggaagagctgaggcgctttgaagaggagctggctgccc^^ 

gaggctctagggcggtcccagggccgcttggaggccaagaagagagagctgctgctggctgtgctgcacatggagcagcagaagcagcagcagcagcagcag 
1 0 caaggccacaaggccccggctgcccaccctgaggggcagctcaagttccaccx:agacacagacgatgtacctgtcccagctccagccggtgaccagaagg 
gtggacacttcagaaaagaaacttctcgagcggctccctgaggttgaggtgccccagcatctgtgatctcggaccccagccctcaggattc^ 
actgatgggcgctggatgaagtggcacagtcagcttcc 

200649_atHG-U133A 

1 5 tcccaagagggtctgctctgagcctgcgttcctaggtggctcggcctcagctgcctgggttgtggccgccctagcatcctgtatgccc^ 
ctgctccgggccaagcttctggttgattaatgagggcatggggtggtccctcaagaccttcccctaccttttgtggaaccagtgatgcctca 
cagctgggtgccaggggcaggggatcctcagtatagccggtgaaccctgatac^ 
tcctcctggacctcttggcccccagccccttcccra^ 
agcccctgaagaccagtcccacccacctttctggcctcato^ 

20 

200650_s_at HG-U133A 

gtcatatcatttcactgtctaggctacaac^ggattctaggtggaggttgtgcatgttgtccmttatctgatctgtgattaaagc^ 
acatcaactcctgaagttagaaataagaatggtttgtaaaatccacagctatatcctgatgctggatggtattaatcttgtgtagtctt^^ 
gccacctctgacgcaccactgccaatgctgtacgtactgcatttg^ 
25 gtccaacattttttcccagtgagtcacatcctgggatccagtgtataaatccaatatcatgtcttgtgcataattcttccaaa 

200654_atHG-U133A 

atcagggtatttgttccaccttggccaggcctcctcggagaagcitgtcccccgtgtgggagggacggagccggactggacatggtc^ 
cgccatgactgatcatggctcttgcattWgggtaaatggagacttccggatcctgtcagggtgtcccccatgcctggaagaggagctggt^ 
30 gcggcacagcctgggcctccccttccctraag 

aggactaccgaccagccggctatgggcacattacgtgaccactggcc 

200655_s_at HG-U133A 

acctattcaaatgggttctagttraamgfflagtataaattgtcatagctggfflart^ 
35 gtgaaggataaaccgttgagacgtggccccactggtaggatggtcctcttgtac^^ 
tgttggtttagcgttgtctgcattgttctagagtgaaacaggtgtcaggctgtra^ 
ttttaaaaccttctgaaccatgacttggagccggcagagtaggctgtgg^^ 
caagttgt 

40 200656_s atHG-U133A 
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tctgaagtgac^cctcacccctacac^ccgtccgttcacccccgtctcttccttctgctmcggtttttggaaaaccc^ 
gtttcctgaaaccatgatgtactttttcatacatgagtctgtccagagtgcttgctaccgtgttcggagtctc 



200659_s_at HG-U133A 

5 gtcactgatggaaggmgcggatgagggcatgtgcggctgaactgagaaggcaggcctc^^ 

gggaggggcagagaggaagtggtctgtctgttaccataagtctgattctctttaactgtgtgaccagcggaaacaggtgtgtgtgaactgggcac 

gcccctgttgaggtgggtgggcctgatfgttgccccccaggg 
agacttcctcttfacccccraccttggtcccto^ 

10 200661_at HG-U133A 

gagtggtttgtggattccctcaaccagaagatggaggtgcagcgccggccctggttagtgaagtacggggacagcggggagcagattgccggcttc^ 

tctcccacatc^cctttctcacgatcaagggcgccggcra 

ctgatgaccacagcaaccagctccacggcctgatgcagcccctcrc^ 

ccaggactgcccccttcccagagccctgtacatcccagactgggcccagggtctcccatagacagcctgggggcaagttagcactttatt^ 
1 5 atggggtggcxtggccxcttctctgcttaaagaatgccctttatgatgcactgattccatcccaggaacccaacagagctcagga 

200665_s_at HG-U133A 

gttggttcaaacttttgggagcacggactgtcagttctctgggaagtggtcagcgcatcctgcagggcttctcctcctctgtctttt^ 
ctctagggactgccaggctgmcagccaggaaggccaaaatcaagagtgagatgt^^ 
20 acatttggatgattgtcataaggtffitagcatgttcctcctm 

gctgagaactcgttcacctccaagcatttcatgaaaaagctgcttcttattaatcatacaaactctcaccatgatgtg 

200673_at HG-U133A 

atgcctgaaaaagaaccaccac^tccttacttacctgcctgaagaaattctgccmgacaataaatcctataccagctmtgtttgm 
25 gggctcttraaacttgtttgatataaaatatgttgtcttt^ 

taatagtttttgaagacaatctaggttaagcaagagcaaagtgcc^ttgtttgcctttaattggggggtgggaagggaaagagggtacttgcca^ta 
tgcactttctttatataatcgtttgcatmgttacttgctaccctg^ 

tctgcagaggcaagaaaaatatttgacattgtgacttgactgtggaagatgatggttgcatgtttctagtttgtatatgttte 

30 200675_at HG-U133A 

tgatcctgagcatggtgctgtgctgtggcatccggaacagctcx^ 

cacttccgagggggccatcaccgcctgtgtatataacgtttccggta^^ 

ggctgatgtcacatgtaggtggcgtgtatgagtggagacgggcctgg 

tcagccaggcctc^cctgggagcc^ctcgcccagagactcagcttggccaacttggggggctgtgtccacccagcccgcccgta 
35 gttccctcctgccccggttcgagagccgagtctgtgggcactctctgccttcatgcacctgtccmctaacacgtcgccttc^ 

200679_x_jat HG-U133A 

aagccgagaggcaaaatgtcatratatgcattttttgtgraaactt^ 

agtgctc^gagaggtggaagaccatgtctgctaaagagaaaggaaaamgaagatatggcaaaagcggacaaggcccgttatgaaagagaaatgaaaaccta 
40 tatccctccraaaggggagacaaaaaagaagttraaggatcccaatgra^ 
aaggagaacatcctggcctgtcc 

200707_atHG-U133A 
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ggcacctggggctcatggattggccccgaccacgacaagttcagtgc^ 
gcgcctcctgtgcgggaaagagaccatggtgaccagcacc^^ 

ccgcctgaagcacccaccgaagacgaccatgacgagctctagctggatgggcgcagagaacctcaagaa 

5 200742_s_atHG-U133A 

ttctcttttraatcaggcttttcraaagggttgtatac^ 
ctaccctgacatccagaaacaatggcctccagtgcatacttctcaat^ 

tctgactactcttgtcttcctctctcatcaatttctgcttcttcatggaatgctgaccttcattgctccatttgtagattttt^ 

ctgacatctac^accattaccatctcactaaataagac^tctatccaataatgattgatacctcaaatgtaagatgcgtgatactcaacatttcatcgt 
1 0 cccaaacaattccatctcgtttcttcttggtaa 

200743_s_at HG-U1 33A 
ccacccctgatacaactgccaggttaatttccagaattctt^^ 
gaaaaatcctagattttctaaacaaaacctgtttgaatcttggttc^ 
15 agtgtgaatatcaatacctgcctttcataatcaccagggaata^ 

cccttgcactgcgccccctgtgcctacctctagcattgtaacgaccacatagtattgaaatggccagtttacttgtctgccttcctttc^ 
tagaatcgtgtcctatttaactttgtgttcccaggtcctagctcaggagttggcaa 

200765_x_at HG-U133A 
20 aatgcaggagcctacttctagctgtatttUtgtatgctt^ 

attctcttagtgatggcggcgttaggtttgagagaaggaattggctcaacttcagttgagagggtgcagtccagacagcttgactgcttttaaatgacc 

gtggtaagcaacctggcatcttaggaagc^gtccttgagaaggc^tgttccagaaaggtctctgaggac^aactcactcagtaaaacataatgtatcatgaa 

actgattctctatgaratgaaatgaaaattttaatgrattgtt 

atcagaacacactttttttcctctttctcccagcttcaaatgcacaattcatcattgggctcacttctaa 

25 

200770_s_at HG-U133A 

gcagggatggcttcacaggctgctcaagaagccgagatcaatgccagaaaa 

ggggcagctggatacagtggacctgaataagctaaacgagattgaaggcaccctaaacaaagcc^aagatgaaatgaaggtcagcgatcttgataggaaagtgt 
ctgacctggagaatgaagccaagaagcaggaggctgccatcatggactataaccgagatatcgaggagatcatgaaggacattcgcaatctggaggac^tcag 
30 aagaccttaccatctggctgcttcaacaccccgtccattgaaaagccctagtgtctttagggctggaaggcagc^tccctctgacag 
gagtgccttgacacaaagattacatttttcagacccccactcctctgctgctgtccatcactgtccttttgaaccagga 

200825_s_at HG-U 1 33A 
ctgggctgaccaaaatgtgctttctactgtgagtccctato 
35 ggcctgtggatgaaggaggagggtcagggctggccttcctctgtgc^tcactctgctaggttgggggcccccgacccaccata^ 
cagtattccgtctgtagcaggagctagggctgctgcctcagctcca^ 

gatgggacctaagggtcccacccctcaccccacccccacctcttctgtatgtttgaattcmcagtagctgttgatgctggttgga^^ 
cattgttaattgag 

40 200829_x_at HG-U133A 

attatcttcccacataccaggaactattggacatttattttac^tgggaaaaattatttggaataataaagcaggaacttttcctgaa 

tcatgfficatctaggttmagaagtgaagtatagtaaaffiggttcgttaaattgtga^^ 

tacattgctatttgtaaagtttgaMtcacagcamragatgctg^ 
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ctgaatgtgtaaraatamaatgtatttagagttcctcatgttgcag 

tgagagaaaaatttatgccttmattcataacccagctgtggacxactgcctgaaaggtttgtacagatgcatgccacagtagatgtcc^ 

200832_s_at HG-U133A 
5 aaaagcgaggtggcc^tgttatgctggtggttgccagggcctccaacc^ctgtgc^ 
ttccttctgttaaaatggggataataatactgacctacctcaaagggcagttt^ 
agacctgagtattttttataataggaatgtcraccatgaacttgatacgtccg^ 

agaagcggtggataactagccagacaaaatttgagaatacataaacaacgrattgccacggaaacatacagaggatgrc^ 
ttttatgtgggatatagtagttacttgtgacaagaataattttggaataatttctattaatatcaactctgaagctaattgtactaatctgagat 

10 

200867_at HG-U133A 

ttgcmaggaacagctcaagaaccttggagttratatttracaaatattaataaatata^ 

aaaggratatgggtttaggttggmtttgatgtcatatgtctctgatggggctgca^ 

agctgcagatgccgagaagccagcaaacacagggcccactggaa 

15 

200872_atHG-U133A 

gcttcaacggaccacacc^aaatgccatctcaaatggaacacgccatggaaac^atgatgtUacatUcacaaattc^ctggggataaaggcta 
ggacctgagagtactcatggaaaaggagttccctggatttttggaaaatcaaaaagaccctctggctgtggacaaaataatgaaggacctggaccagtgtagagat 
ggcaaagtgggcttccagagcttcttttccctaattgcgggcrt^ 
20 atgagcagttcgctcctccctgataagagttgtccaaagggtcgcttaaggaatctgccccacagcttcccccatagaaggatttcatgagcagatcagg 
caaat 

200890_s_at HG-U133A 

gttggtcttaccagaatcattgttgaactacctgaaacaagtctttagaatactaaatataatgctgttgtctcttcctttttgacattttctgatttt 
25 atatttacccactatgattattgatgtcctgccttgaacagttttaaagaaaacaatttttggaatagctcaaatttcaattgatggcacaaatca^ 
attacaattagtattctaaaggcagaagcagaagtagctgctttttagcaatagaattgtttragt^^^ 
aaggaatttggggatctctctggcaacaaattgtgaaacatgaaatttctgctgactttaatatatgaaacctaatcctaccc 

200906_s_at HG-U133A 
30 gttgctgatgggtacccagtgcggctggaatgtcgtgtattgggagtgccacca^ 

gtgagcatgcaccaggac^accacggctacatctgcctgctcattcagggagcc^caaaagaagatgctgggtggtatactgtgtcagccaagaatg 
gattgtgtcctgtactgccaggctggac^tttacaccc^gtggcatcag 
ctttcggaccagggactagacatcaaagcagcgttccaacctgaggccaacccatrt^ 
cattcttgttaaagctgaaacactgaaacagccattgccttgaccaacatattcc 

35 

200907_s_at HG-U133A 

aaacactgccsttcacaagtraaggaacccagggccagctggaagtgtggagc^ 
gtagtagtggaagattttaggtatgtagagcaagttgaaaatggattgagactgcatggtggrataa^^^ 
gagacataaggagagacaaaaacaggtttgtgccataaagtattttttcaaagacaccaagatg^ 
40 tgccttaagaaggtgctggattccaaggtttgtaaaggcatctc^gta^ 

20091 7_s_at HG-U133A 
ttttctcraagggcgggcttgtgctctggtgcttcragggc^ 

aacaactccttcacccatgaggcactcacactcaagtataaactggac^accsgtttgagctggtgtttgtggttggttttcagaaga 
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aaattgatagatgacgtgratcggctgtttcgggacaagtacc^ 

ctgcggctccttcgtgaagcagaggagagcagtaagatccgtgctcccactaccatgaagaaatttgaagattctgaaaaggccaagaaacctgtgaggtccatga 
ttgagacacggggg 

5 200931_s_at HG-U133A 

tgcaaccaactacaaccaagctctctgcatctactcccaagtatggggttcaagagagtaatgggtttcatatttcttatca 

gagggcagtgtttcctttttggtacttattactgctaagtattto 

aacatccttccttcccacccatcgctgtgtgttagttcccaacatcgaatgtgtacaacttaagttggtcctttacactcaggcttte^ 

ttttcaaggccctcagcatatttgtatagttgcttgcctgate^^ 
10 acttagaggaaataagaaaaatcatgmgctctcccggttcttccagtggtttgagacactggtttacactttatgccggatgtgctW 

200935_atHG-U133A 

ggccaggccaaggacgagctgtagagaggcctgcctccagggctggactgaggcctgagcgctcctgccgcagagcttgccgcgccaaataatgtctctgtgaga 
ctcgagaactttcatttttttccaggctggttcggatttggggtggattt 

15 

200951_S_at HG-U133A 

aagcatgctcagaccttcattgctctgtgtgccacngac^ttaagtttgccatgtacccaccgtcgatgatcgcaactggaagtgtgggagcagccat^ 
gcaggatgaggaagtgagctcgctcacttgtgatgccctgactgagctgctggctaagatcaccaacacagacgtggattgtctcaaagcttgccaggagcaga 
aggcggtgctcctcaatagcctgcagcagtaccgtcaggaccaacgtgacggatccaagtcggaggatgaactggaccaagccagcacccctacagacgtgcg 
20 ggatatcgacctgtgaggatgccagttgggccgaaagagagagacgcgtccataatctggtctcttcttctttctggttgmttgttctttgtgtttt^ 
aaaaaattctgcccccacctagatcatatttaaa 

200952_s_jat HG-U133A 
gcaagtcatgctgaatactcctcccctcttctrt^ 
25 cclaccangactcaagagacancccttcctgctgacattcccatcacaacattcctcaga^ 

cttaattcccatctctatacttctcctttggacatggaaagaaaagttattgctggtgcaaagatagatggctgaacatcagggtgt^ 
ttattgttgttgttaattttattgcaaagttgtattcagcgtacttgaatttttcttcctctccacttcttagaggcat 

200953_s_at HG-U133A 
30 gcc^ttacagtatccaatgtcttttgacaggtgcctgtccttgaaaaacaaagtttctatttttatttttaattggmagttcttaa 

tcatcgggccattggattmtccattatgttratcacc^^ 
ttmtttttgctttaaaacaagtgtgatgccatatcaagtcc^ 

gtagcatgatcagtgtatgcgaaaaggtmtaggaagtatggcaaaaatgttgtattggctatgatggtgacatgatatagtcagctgccttttaagagg^ 

gtgtt 

35 

200967_at HG-U133A 

gcaaattccatcgtgtaatcaaggadtcatgatcragggcgga 

agaacttcaaactgaagcactacgggcctggctgggtgagcatggccaacgcaggc^aagacaccaacggctcccagttcttcatcacgacagtc^ 
tggctagatggcaagcatgtggtgtttggc^aagttctagagggcatggaggtggtgcggaaggtggagagcaccaagac^gacagccgggataaa 
40 ggatgtgatcatcgcagactgcggraagatcgaggtggagaagcc^ 
ccctgtagtccgccacagggctctgagctgcactggccrc^ 
acaaactcctaccaacactgac 

200968_s_at HG-U133A 
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gacttcatgatccagggcggagacttraccaggggagatggracaggaggaaagag 
cgggcctggctgggtgagcatggcc^acgcaggcaaagacaccaacggctcccagtt^^ 



200984_s_at HG-U133A 
5 ttgaactagattgcatgcttcctccmgctc^tnggaagaccagctttgcagtgacagcttgagtgggttctctgcag 
cagttagcatcattagtaratctttggagggtggggcaggagtatatgagca^ 
atgccaaatgttccatatgtgggtgtcagtcagggacaaraagatccttaatgcagagrt^ 
catgaaaacttagagaggtacaagtggctgaaaatcgagtttttcctctgtctttaaattttatatgggctttgttatcttccactgg 

10 200985_s_atHG-U133A 

ggtaggatatcttggctttgccacacac^gttacagagtgaacactctactacatgtgactggcagtattaagtgtgcttattttaaatgttac^ 
ggtatgtgtgtatatagtatgaatgcagtggggacacxctttgtggttacagtttg^ 

tgcatccagcctcctgctaactcctagctgactc^gi^tagattgtataaaataccmgtaacggctcttagcacactcacaga 
aaaatgatacacacctttcacaagggcaaactttttccttttccctgtgtattctagtgaatgaatctcaagattcagtagacctaa^ 
1 5 tttaatggcataggctgacttttgcagat 

200986_at HG-U133A 

gacgacc^gccaggatatgctctcaatcatggagaaattggaattcttcgatttttcttatgaccttaacctgtgtgggctgacagagga 
atgcagcaccagacagtgctggaactgacagagactggggtggaggcggctgcag 
20 ccttcctcttcgtgctctgggaccagcagracaagttcc^ 
cctctccagcctcagctctcagttgcagccctgc^ 

gggcaagggacctgcttctattagcccttctccatggccctgccatgctctccaaaccactttttgcagctttctc 
200996_atHG-U133A 

25 aagtgtatcaccatgcagatgtagaagagagcgaaagtgattgtgtttttctttagattgaatatttgaatcttatgtgtaacaaaaagaagtgggttttagtt 
ctgatattttgtatattaatgaattatccaagattcgatgggatttatcagtgtgtagatagctctataatgcttgaattgtac^cttctaagtgtgc^ 
atttcatactttttatactttgaggaaaaaaagtcaaagaaaaattgtatttgagggaaaaaaccatg^ 
tggcatacacac^cgtgggattccagttattatggagtgcttccatccctctccaccccttccccccaaaaggttttctttgca 

30 200998_s_atHG-U133A 

tgaaagcctcagtcagcraagtggaggcggacttgaaaatgctcaggactgctgtgga^ 

ggaatcagccaagggtttactagatgacctgaggaatgatctggataggttgtttgtgaaagtgga 

gcggatttaaagtccaatttctcatgacraaaaaatgtgtggtttmc^ 

caaggc^ttgtatmcatgcccagttaacttatttacaatamaagttctrt^ 
35 atgtctctagt 

201004 at HG-U133A 

agaggcgatggcggcgatggcatctctcggcgccctggcgtf^ 
cctactacacracttctgac^ctgtcatttcxactgagaccgtc^^ 
40 gtggaaaacaattccctgtcactcgaggccaggatgtggggcgttatcagg^ 
cttcgacgaggagtcctacagcctcctcaggaaggctragagg 
gaacgggccctgggtgtccatfgaggtgctggctgcggcgata^ 

201005_at HG-U133A 
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201008_s_at HG-U133A 
5 gcaaaggggWcctcgaWggagctttttttttcttcxaccgtcatttctaactcttaaaaccaactcagttccatcatggtgatgtt^ 

ctttaacgaccctgaaaaggtgtacggcagtggcgagaaggtggctggccgggtgatagtggaggtgtgtgaagttactcgtgtcaaagccgttaggatcctggcttg 



10 

201012_at HG-U133A 



15 

201015_s_at HG-U133A 
agcttcagactcaagtaccxattctgttttcccccagra 

gtcti 



20 tcactccccagggagctccggggacagcttcctcacacccctgtccra^^ 

gtgaggggctttctcttccccttcrtgtttcagacccccccatttcccgcacatggtgtggggggctgggggaggtccaagcagagtgttttattatt^ 

ttat 

201029_s_at HG-U133A 
25 agcccaggcgttggcagcagggttagaacagctgcctgaggctcctccctgaa 
ttttaatcttgcgatgtgctttgcttgttgctgggcggatgatgtttac^ 
gggcggtttcccatgggatgtgaaaggctggccattattaagtccctgtaactcaaatgto^ 

aatcacgtgtccatcgagcacgtctgaaacc 
30 201034_atHG-U133A 



agtgtgttttggagctggctctgtttgtgtgcatatgataatgcagagttgagc^^ 
tggtggaaactttcagttgcttaactctctattggaagattttMaatgttcte^^ 
tttttaaaaatatatatatctaaatgaatgcaatgtgcataaatattttttaaacataa^^ 
35 agccaatgaacctctgtgtcctgtggaa 

201061_s_at HG-U133A 

tatagctctctttagctc^accactctgtcc^tccagccaatggatgtccttccctgtacccaattcaagcttatt^^ 

ctgagttattgaggattgagccagtgcaacgttaaactragtgcacttacatttgamaaatgatggtmatctgttgtgtgaagtggttcacccttgaggaccaggag 
40 ccatatccti 



201069_atHG-U133A 
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ctcagagccacccctaaagagatcctttgatatWcaacgcagccc^ 
agaacccttggagccaatggagactgtctcaagagggcactggtggc^^ 
cccctggcttttractgctggctgccttagaaccm^ 

gaaggactcaggttgtctgaagtcactgcac^gtgcatctcagcccacatagtga^^ 
5 ggaggaaaaccaagccgtggcttcccgctcagccctccctgcccctcccttcaaccattccccatgggaaat 

201079_at HG-U133A 

cccgcctgcagtggctagaagccagcaggtgcccatgtgctart^ 

ctgccaaagactcgtgggggccatcacacctgccctgtgcagcggagccggaccaggctcttgtgtcctcactcaggtttgcttcccct^ 
10 gggggccaccaccctgtgccggtggcctctgggctgcctcc^ 

agggctttgcctgacaacacccagctttatgtaaatattctgcagttgttacttaggaagcct 

201105_at HG-U133A 

ctcctggactcaatcatggcttgtggtctggtcgccagcaacctgaatctcaaacctggagagtgccttcgagtgcgaggcgaggtggctcctga 
1 5 gtgctgaacctgggcaaagacagcaaraacctgtgcctgcactt^ 

cggggcctgggggaccgagcagcgggaggctgtctttcccttcc^gcctggaagtgttgcagaggtgtgcatcaccttcgaccaggccaacctg 
ccagatggatacgaattcaagttccccaaccgcctcaacctggaggccatcaactacatggcagctgacggtgacttcaa 

201 1 25_s_at HG-U133A 

20 agcccacagtgcctgtacaggaaggtgcctggccatgtcacctggctgctaggccagagccatgccaggctgcgtccctccgagcttgggataaagcaaggggac 
cttggcgctctcagctttccctgccacatccagcttgttgtcccaatgaaatactgagatgctgggctgtctctcccttccaggaatg^ 
aagaagactgtcaggaagggtcggagtctgtaaaaccagcatacagtttggcttttttcacattgatcattmatatgaaataaaaagatcctgc^ 
agtcctgagacttttctgcgtgatggctatgccttgcacacaggtgtt^^ 
cgtgtttgtttagtac 

25 

201 1 37_s_at HG-U1 33A 

ggcagcattraagtccgatggttcctgaatggaraggaggaaacagctgg 
gctggaaatgaccrcccagcagggagacgtctacatctgcraagtggagra^ 

cagagtaagacattgacgggagctgggggcttcgtgctggggctcatcatctgtggagtgggactcttcatgcacaggaggagcaagaaagttcaacgaggatctg 
30 cataaacagggttcctgacctcaccgaaaagactaatgtgccttagaacaagcatttgctgtgttttgttaacacctggttccaggacagaccctcagcttcc^ 
gatactgctgccaagaagttgctctgaagtcagtttctatcgttctgctctttgattcaaa 

201152_j>_at HG-U133A 

gaaatccctgagtatcaggccttgttataaataagctgcataatcaataaatagaara^ 
35 gcgtgtgtgcaaactcttgagggttgattatgctgcaamagcatgttggaacgtctagggagaaggttgactttttgcacttctgtatate 

ataatagtaagatcttatmgaataaaaacgtctataattacaaggagtmgttaaggctaatacaatgacagactgagcaaaattgcttgcaaaagtggca^ 

agcactccataccxxttcaaac^tgttgcWgctttcttgtggacagcttgtagmgccaggattttttcagct^ 

aaaactccatggggccaaatctgcctgaagatcattaccaaaaatagcaggtacttctaccattaaggtg 

40 201153_s_at HG-U133A 

ttcagatragtttctgccaafficagtggtttattgttcacaaaaaaa^ 
ctgtagctatagttgttatgtgattgtttmaattgctgtaggat<^ 

tcaaaaatgcaaaatgtgataatggg^cttgtttaaaagaattagtgtatccagccttcactccagctggttaaaaatgttgcactt^ 
tgctgaaaggaraaatgtgcttggttttacrtattatgtaatcacaa^ 
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201162_at HG-U133A 

ggaatgtcactggtgcccaggtgtacttgagctgtgaggtcatcggaatcccgacacctgtcctcatctggaacaaggtaaaaaggggtc^ctatggagttcaaag^ 
acagaactcctgcrtggtgaccgggacaacctggccattcagacccggggtggcccagaaaagcatgaagtaactggctgggtgctggtatctcctctaagta 
5 aagatgctggagaatatgagtgccatgcatccaattcccaaggacaggcttcagcatcagcaaaaattacagtggttgatgccttacatg 




10 

20 1 1 64_s_at HG-U 1 33 A 

cagtatgccaactacgtggtccagaagatgattgacgtggcggagccaggccagcggaagatcgtcatgcataagatccggcccracatcgcaactcttcgta 

acacctatggcaagcacattdggccaagctggagaagtactacatgaagaacggtgttgacttagggcccatctgtggcccccctaatggtatcatctgaggca^ 

tcacccgctgttccctcattcccgctgacctcactggccc^ctggcaaatccaaccagcaaccagaaatgttctagtgtagagtctgagacgggcaagtggU^ 

15 ggattactccctcctccaaaaaaggaatraaatccacgagtggaaaag 

gttttactggtgtataggatacttgtacataggtaaccaatgtacatgggaggccacatattttgttcactg 

201 1 89_s_at HG-U133A 

acagtcctgcttagagcccttaaaaagacttgaaagttcactgggactcagtttaccttaatgccttagcagaagataaatcctacctagagacctttgt^ 

20 ataactgaraactctttgtagtcctccttgtgggtagttaagagtggg 

tgctgttactagtgattttagggcmgttatttaacttamcaagggtgctg^ 

agacctcactgtcgtgtccatgcxaccxccggcatgg^^ 
gaagagttgtacgcccaggcaacaagggctgagctgcgcttgcgtggctgtttcatgaccgc 

25 201200_at HG-U133A 

catgtgcacacagattatmttggctccaaaactggattgcaaaagaaagaggagaagaatattttgtgtgttcctggtattcttttataagtaaagtttac^ 
accagcttcagccagggacaaaatcccctcccaaaccactc^ 

aatagcttctagtgctgctacaatctaagtcaggtcaccagagggaagagaacatggcattaaaagaatcacatcttcagaagagaagacactaatattattaccca^ 
atacatgatttcagaagatgacataagattcctcttaaagaggaaatgtcaggaatcaagccactgaatccttaaagagaaaagttgaatatgagtcattgtgtctgaa 
30 aactgcaaagtgaacttaactgagatccagcaaacaggttctgtttaagaaaaataatttatactaaatttagtaaaatggacttcttattcaaagcat 

201247_atHG-U133A 

gagtgggc^gagggttgcctatggtgggcactaggaatgaggtcccctgcctcgatgcgggtcctaggagaaaaagtcctactmctgggtccccaggtgcagca^ 

ctcccggagactgtttctcxratggcctcctgagtgatgg 
35 ctggggggtgctcacagcagggccaccttgatgraggctggaatgttatccc^ 
aatttatggaccctgcccgcctgcgtgttgtgtgtatg^^ 
tcttgatg 

201263_atHG-U133A 
40 gatctggatccaggctgtacattgaataaaaagattcgaaatgcaragttagracag^^ 

atcxgcaraagagacaataaggtcrac^gggaacgcaccatttctgaaa^ 

gaattttaatgaaaaaattacccagattggct(xatggaaaaggaggaacagcgtttccgtaaaattgactttgtactcgaaaacgtcaatttatattgaactt 
ggagtttggcaaagtctgaaataggtcaacctgcaggcgtaactattt 
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201278_at HG-U133A 

ggaaacgttccc^gttcamtcagtcctgttgtgagcacagttctgaagggWattattgtcaaaataagtmgttttgttttgtttatgt 
aatgctc^ggttcttgtgggagttaatc^gcc^c^tcc^ 
ctctattgattagccccaagtagaaagicctgtggttttatgtttaatggtaatagttgat^ 
5 gaaatagtactttaaatgtccaaatacctaaatgtgctaaactggaggtaactatttcte^ 

201281_atHG-U133A 

agagcatcctggccacgatgaacgtaccagccgggcragcaggcggcra^ 
caacgcggatgtccaggagcgcctgcttccctacttgccatctggggagtc^ 
10 caggccctgggcatgttc^gcgcagccttggcctcggggcagctgggccccctcatgtgccagttcggtctgcctgcagaggctgtgga 

gatgtggaagcgtttgccaaagccatgcagaacaacgccaagcccgagcagaaagagggcgacacgaaggacaagaaggacgaagaggaggacatgagc 
ctggactgagccacgcgccgtcctccgaggaactgggcgcttgcagtg 

201324_at HG-U133A 
15 caccaaattacctaggctgaggttagagagattggccagraaaaactgtgg 

caaaaaacatacttacamcagacatatccaaagggaatactcacattttgttaagaagttgaactatgactggagtaaaccatgtattcccttatctttt^ 
acatttatgtctcatgtaatttgcattactctggtggattgttctagtactgtattgggcttcttcgttaat 

201334_s_at HG-U133A 
20 gatatttccaccacttttcagagtcatctacaaaatmtctttcrt 

gtaggaaaaaaaagtacctcctagaaggaagccttccccaccatttccag 

gttatmtaataagcctcttcctactagaacattttattttccttgttcaccatacaatcatgtactctttaacagaaattgctt^^ 

tttattaataatcatgtatmtactgatcacattttgaaatgcctaaaaga 

gtatgtttctggaacaaaatattggtcatctaaaaactttctgtt 

25 

201338_x_at HG-U133A 

gatgtatgtcgctgtccaagagaaggctgtggaagaacctatacaactgtg^ 
catgctggctgtggcaaaacatttgcaatgaaacaaagtctractaggcatg^^ 
30 gaaaaacgggagtttggcctctcatctcagtggatatatcctcccaaaaggaaacaa^^ 
agacaagatgctctcgacagttgcagtacttacccttggctaagaactgra^ 

201352_atHG-U133A 

gaaatgctgtgtgcattttcatgttctttmttagcattactgtcactctccctaatgaaatgtacttcagagaagcagtatt^^ 
35 tgtccctrattttgactataactgtgcttggtttcaaaagcaaaatte^ 

gtttacttttcattcagatmgtgaatatgaacatgctgttataggatctacaga 
attatattcttgacgttgacttcattaaa 

201358_s_at HG-U133A 
40 gtttgtcttttgtcttaacagttctgaatgctgtcctc^aagtatataatgtttcatgtacc^agacx:cttttcacagtacaataaaca 

201359_at HG-U133A 

aaatgactacttacagcacatattaaagtcaaccaatatgaaatgcctgactccagaaaagg 
atafflggtgaagatgcacttgcaaatgtcagrattgagaagccaattracragggac^ 
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aatggccttaagtcttggagataaaatcaacttgtcaragaagaaaactagtatataa 
tggactccaacatcttttgtactctttcatgcttatatagaatctgagttcatgctgaatacttttcagccaataatttat 



201360_atHG-U133A 

5 ccccatggacgccagcgtggaggaggagggtgtgcggcgtgcactggactttgccgtcggcgagtacaacaaagccagcaacgacatgtaccacagccgcg 
ctgcaggtggtgcgcgcc»gcaagcagatcgtagctggggtgaactacttcttggacgtggagctgggcc^ 
aactgccccttccatgaccagccacatctgaaaaggaaa 
aggacgcctaggggtctgtaccgggctggcctgtgcctata^ 
gtctccccatgtgcctgcaccaggagacagacagagaaggcagcaggcggcctttgtt 

10 

201362_at HG-U133A 

gcaggaaatccttgcaccatgggattaatatccaattgctgcttgtacactcattcattactaaaagttttgagaaatttttttttccagtaatgag 
aataactcacctggcatcttaratctgaaataaggaatgatataaggtttttt^^^^ 

ctattctgagatgactattcagtggctaacttgggttaggaagaaaataattaggtattttctccaaatgttcactggtactctgcc^cm^ 
15 aaccaccaggaaagcaaatcagtttggttggtaactctgtaattcctaactatcactggtttggttctggactaaaactacattgaca^ 
tgtttttaatatggatctgtatgtgttctattcagcccaagga 

201377_at HG-U133A 

cccccttacaagcatttctggacggctgagagctaatttggcccaaggrt^ 
20 actgaggagagcctgctgagctgaggccatggcctgcgtggcttggggaaatgagttggtggataccttctgggcttttgaacttgcccctcc^ 

atgtgtctgaccctgtcttacccamcaagttcaagcggtgcag^ 
201396_s_at HG-U133A 

tcctcgcgactgagagccaggggcgacatcatgaccttctgtcccggccgccttagccccgggcacagggaaggcagctgggccgtttctgtctgt^ 

25 tgtccttctgtcctggatgtttcatgggc^ 

ctaaaacagccaccaggaaagc^gccggagagccggac^gcaggcagcc^gtctgggttcctgaggcctgggggtggcagacgagcccacg 

ccagcagcagggttgtcagtcggagcatcctggggctccctggctcctggccgtctgtgaggtaggcgcagtaccgtgtatcgtaggtagcagtaggaacgggg 

cgccgcggccctgcagccgctcatggcggtgaggtgtgtgccaagcccacccggggtgcagggcgtgacgtgtggggaat 

30 201416_atHG-U133A 

gaggtcagatttggagcttctcattgcacgcggagattattattgc^tcgggttccaagccaatgggaagcccgggggaggggtttggcatgaggaag 

agcagctgattggctgcagccaagactgtgaaaggataaagaggcgcgaggcggaattggggtctgctctaagctgcagcaagagaaactgtgtgtgaggggaa 

gaggrctgtttcgctgtcgggtctctagttcttgcacgc^^ 

acatttattcatgccagtctgttgcatgcaggctttnnggcttcctacxttgcaacaaaataattgcaccaactcctt 
35 cacagtcmtttgctttgcattgtaggagagggactaagtgctagagactatgtcgctttcctgagctaccgagagcgctcgtgaactgga 

20141 7_at HG-U133A 

gtaaaccacatcttttttgcacttmttataagcaaaaacgtgccgtttaaaccactggatctatctaaatgccgatttgagttcgcgacactat^ 
atttgactamaatcctttctacttgtcgctaaatataattgmtagtcttatggcatgatgatagcatatgtgttcaggtttatag 
40 aaacatcmgtacatttaagtctgtattataataagcaaaaagattgtgtgte^^ 
agggtttttgtttttaaacttttttttgccatccatcctgtgcaatatgccgtgta 

201425 atHG-U133A 
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ggcactgtgtgggtcaactgctatgatgtgtttggagcccagt^ 
tacactgaagtgaaaactgtcacagtcaaagtgcctcagaagaact^ 
caaaaccaagaaaaatgatccttgc^tgctgaatatctgaaaaga 
ggttggctgagggtaagagtatatgaggaaccttttaaacgacaacaatactgctagctt^^ 
5 gcttcctataactc 

201433_s_at HG-U133A 
ttcaggccactmctgtcttttamggttactgtt^^^ 

cttctagagcagaaaagggagagaagtgttgtattcctgtttggtaacctcagtctcctgtaagacctcctaccacatggcgagtatacaccaatcagga^ 
10 ctgcctgcataggagcctcgcttcc^attattcccttcccaatattattcatccagacttagcc^cagtgcacaaaag 
agcttctccccagcttggtgrcttttacate^ 
atgctctagaaccttgaccttgat 

201437_s_at HG-U133A 
15 ttWttcctcatgtaacattggtgaaggatccagga 

acaaatttatgcatttcattcaaatragtgatctatgtttgtg^ 
ctgtmtgtatctttatgctgtattttaacactttgtat^^^ 

tgtaaataaaatgtgtacagtgaattgtcttttagacaactagamgtccttttatttctccatctttatagaaggaattt^ 
cttttgtggtgtcttccatgtgaacagcataagtttggagcactag 

20 

201462_atHG-U133A 

atatgcctgttagaccttagctgtggggttcctttactatt^^ 

cattggtacaggtgttattcagtactgggtggttcagcagggaagc^ 

catttattggggaaggtcttcagttgagctGtcagccttaggaaggaagcacgtggaggagggacggaggaggttcccttgctgggca 
25 agcagcaggtc^tgtgcacatgccgttgcagcacaagcttatgcttcccgtagccgtggcttttcattctgcacagtcccaggtc 
aatgtgctttatctgattgactccaaacatcccgaaatgtcacctgcagatttctcgtgggaa 

201463_s_at HG-U133A 
caaaacrattgtcatgggc^cctccttccgcaacacgggcga 
30 aggacaacgccaagctggtgcctgtgctctcagccaaggcggc^ 
aggaccagatggctgtggagaagctctctgacgggatccgraagtttgrc^ 
atggaaagtagc^catccctgaggctggactccagatctgcaccgccggc^ 

201482_at HG-U133A 
35 tggaatggaactcctcartagctgctggggctccgcccaccdgc^^ 
atgcctggccagcdccagctcctcagacctcctgggtgg 
ggttgctgggctgtagggtgagtggcttgcttggtgggacctgacgag 
atggctgctttcacttccxcccattgagctctgctccctctgagcctggtcttttgtccmttt 

ggg 

40 



201485_s_at HG-U133A 
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ctajgagcctgtctcagtagagtactggctcctttlataatttgttacra 
tct 

201486_atHG-U133A 
5 gatatmtccaaacgtattgagcaacaaaatattaatattgtgccatatgacaacaaa 

(xagacttgaaaacatggaggamtagagatgcctgaacaatattatttaagtagtatgtgacxgagctataaattttttgtttttgttctaagtagatttaatttgggaactga 

caggacaatgtttttaggtttagcattttgtttaaaaacctttaaagaaacctttagaaggacttagacctcacatattaatgttgagaagttctgcttaatttt^^ 
ataaagggttttattgtatgaaatagaactttatatttttgc^tatgtatagaggataattatatttaatgtataactatagcattatggtgagtggaamgacattgtc^ 

tttttcatt 

10 

201496_x_at HG-U133A 

tcaagtccaagttcaagtccaccatcgcggcgctggaggccaagattgcacagctggaggagcaggtcgagcaggaggccagagagaaacaggcggccacc 
aagtcgctgaagcagaaagacaagaagctgaaggaaatcttgctgcaggtggaggacgagcgcaagatggccgagcagtacaaggagcaggcagagaaag 
gcaatgccagggtcaagcagctcaagaggcagctggaggaggcagaggaggagtcccagcgcatcaacgccaaccgcaggaagctgcagcgggagnngg 
15 atgaggccacggagagcaacgaggccatgggccgcgaggtgaacgcactcaagagcaagctcagagggccccccccacaggaaacttcgcagtgatgcacc 
aggcgaggaaacgagacctctttcgttccttctagaaggtctggaggacgtagagttattgaaaatgcagatggttctgaggaggaaacggacactcgagacgcag 
acttcaatggaaccaaggccagtgaataagcaactttctacagttttgcaccacgg 



201497_x_at HG-U133A 

20 tgagccaggaagaacgctccagcccaggacttcgaggctgcaatgagctataattgcatcattgcactccagcctgggcaacagagaccctgtctcaaccaccac 
caccaccaccacccctactacccx*gtattraaggtaa 

ctatcaatggattgtgcagacactgagtttttagaaaaacatatoacggtaaccggtccctggcaattctgtttacatgaaatggggagaaagtcaccgaaatgggtg 
cc^ccggcccccactcccaattcattccctaacctgcaa^ 

acagttcaccaagtgccttctgtagtcacatgaattgaaaaggagacgctgctcccacggaggggagcaggaatgctgcactgtttacaccctgactgtgGttaaaa 
201506_at HG-U133A 

acaggaggaatgcaccacggcagctctogccaatttctctca^^ 
ccataatgagatgtgagccttgtgcatgtgggggaggagggagagagatgtactttttaaa 

gatgtcactgcctgacattcacttccagagaggacrtatcccaaatgtggaattgactgcctatgccaagtccctggaaaaggagcttcagtattgtggggctcate 
30 acatgaatcaagcaatccagcctcatgggaagtcctggcacagtttttgtaaagcccttgcacagctggagaaatggcatcattataagctatga 
(^aatgtgtctracatctaca^tggcttggaggcttttatggggccctgtccaggtagaaaagaaatggtatgtagagcttagatttccctatt 

201 530_x_at HG-U133A 

gatgcatgctc^agatttcactgtatc^ccatgcatggagatatggaccaa^ 
35 ccactgacctgctggccagaggcattgatgtgcagcaggtttcttta^^ 

gacggtttggccgtaaaggtgtggctattaacatggtgacagaagaagaraagaggac^^ 

ctcaatgttgctgacctcatctgaggggctgtcctgccacccagccccagccagggrtraatctctgggggctgaggagcagcaggaggggggagggaa 
ccaagggatggacatcttgtcattttttttcttt 

40 201537_s_atHG-U133A 

grtgctctggaataWcccttcgatcttatctcagtc^cttcgWttgagaagagtgatgccttgggtatgcttttWttttcttW 

tttaaaaaccccaccamggagaattaraagggttttgtcctgaattgtagtgtt^^ 

gaaggaggaagttggccaattaragcgtgtgtgcatggatgtgtgtggggggcgtgcctdcagaaacgcggcx^gaagacaagcagggaagtgaaaggtccc 
aggcacaraccrtgccrattgcaggtggctcttacagctct^^ 
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aaatggatccmattggtggtagagcaaaaaaacccaaacacgataaaccm^aaagacmrt^ 
catt 



201540_at HG-U133A 
5 cagggctgtcatcaacatggatatgacatttcaraacagtgactagttgaatcc^ 
ttctgttgtgattcccaggartmcctcaagaggaaatctggamc 

taatgagacaatatraaaagtaaacatgtaatgacaatacatactaacattcttgtaggagtggtt^ 
tctaacgtttcagtg 

10 201548_s_atHG-U133A 

tacccccctacttaatgtaattcaggactccaaccaagaggatttctta^ 

ccttatgggatcatttttccaggactctttgaagaaaagaaaaaacaactaaaaaaatt^^ 

gctgactgtttfflttgc^ggggttcccacc^atttggaaggcattgaag^ 

aactcacttcc^gtgcccaaaagggcacccaccagaaattccagtaaatcctc^tttgaggaagctctcccttgtttactctgttaccaca 
15 cactttgggagtttttgtttgtttcttcttttccmatcca^ 

201564_s_at HG-U133A 

ccctggagcggcagggcgtgacggccaragggtctgcccgctgcacgttrt^ 

tttcctttcaccctagcctgactggaagcagaaaatgaccaaatcagtatttttmaatgaaatattattgctggaggcgtcccaggcaagcctgg 
20 atctggcggggggcgtctcagcaccctccccagggggtgcatct^ 
ccagagcccctgctgtgattggtgctc 

201585_s_at HG-U133A 

gac^tgcgtaGtgagcgctttgggcagggaggtgcggggcctgtgggtggacagggtcctagaggaatggggcctggaactccagcaggatatggtagagggag 
25 agaagagtacgaaggcccaaacaaaaaaccccgattttagatgtgataWaggcmcatta^ 

aagaaagctatctttttatggatgttagcagtttattgacctaatatttgtaaatggtctgtttgggcaggtaaaattatgtaatgcagtgmgg 
tttatttctttattttttcttttttactgtataatgtccctcaagtttatggcagtgtaccttgtgccactgaatttcc 

201592_at HG-U133A 

30 tcagccttgccagcagcaatcatttggggaagaatctacagttgctgatggacagagtggatgaaatgagccaagatatagttaaatac^acacat 
actagtaaacaacagcagragaaacatcagtatcagcagc^tcgccagcaggagaatatgc^ 
cctgtccaaactcttcaaaccaccaragccgcct^^ 

aaaacttaggc^agctcttc^tggccc^ggctcttcaagaatacaacaactaagaaaaggaagtttccagaaaagaagttaacatgaactctt 
ggcaactcttggaagaaatataWgcatattgaaaagcacagaggatttctttagtgtcattgccgattttggctataacagtgtctttctagccataat 

35 

201 595_s_at HG-U133A 
ctttccacctagaatraaraggatgtttatttcctatgctgattc^ 

agtaagttcagagtagttcatgataaattgaaaatataatggtcattgragaaaatgattgatgttgte^ 
ttcatttgggcatgtgctattaccagaaaraacaaacttatatttaaaatacc^ 
40 cctaaaacatgagractgtacttcataaaggaaactgcgtatgcagattcagtattgtg^ 
atcagctacaatgccctgtgttaaattg 

201596_x__at HG-U133A 
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gagagcaccacagtggtcaccacacagtctgctgaggttggagrt^ 
tccatgagaaatctgaaggccagcttggagaacagcctgagggaggtggaggc^ 

gtcagagctggcacagacccgggcagagggacagcgccaggcccaggagtatgaggccctgctgaacatcaaggtcaagctggaggctgagatcgccaccta 
ccgccgcctgctggaagatggcgaggactttaatcttggtgatgccttggara^ 
5 gg 

201 602_j5_at HG-U 1 33A 

atcagagaacgacgacgaccaagagagaaaagaagatctacaggagtttcattttggacacaagatagtgatgaaaatgaacaagaacaacaatcagacaca 
gaagagggatccaataagaaagaaactcagacggattccatttctagatatgaaacca 
10 atacagttacttagaagaaagaaaaccttacagcagcaggctagaaaaggatgactcaactgactttaaaaagctttatgaa^aattctag 

ctgaaggcacagctacatgatacaaatatggaactaacagatcttaaattacagttggaaaaggccacccagagacaagaaagatttgctgatagatcactgttgg 

aaatggaaaaaagggaacgaagagctctagaaagaagaatatctgaaatggaagaagag^ 

aaggatgaaaatggggccttgatcagagttata 

15 201640_x_at HG-U133A 

ccaccagggaggaggcctccacgtccctgcccacc^agcccacccagggggccagctctgccagcgagccccaggaagcccctccaaagccagcagagga 

caagaaaaaggattagtcgagactggtcctcacctgctccggcta 
cagatcaggccggggcggtgggaggcccgcctcaggtcagggcccagcg^ 

tctgtccctctgtgtttccagccatctcgccctgccagcccagc^ccactgggaatcatggtgaagctgatgcagcgttgccgag^ 
2 0 gccgggcccccctacgggatgcccacggc^ ■* 

201644_at HG-U133A 

ctggcccagctctttatctgggtcctgcgggagtacaatgaagt^^ 

cggtggtggaggccatggacttccatggggaagtcacctttgatacaaccaagtcggatgggcagtttaagaagacagccagtaacagcaagctgaggacctacc 
25 tgcccgacttccggttcacacccttcaagcaggcggtgaaggagacctgtgcttggttcactgacaactacgagcaggcccggaagtgaagct^ ^ 
caggtgccagcggaccatcggctggcagag 4 '4 

■i 

201652_atHG-U133A 

tacccaaagggctacaaacctcctgatgaaggaccttctgagtaccagac^ 
30 agtctcatatttc^aatcctcmggatcgcaaattgcttgagctgttgtggaataaatactgggtgaatacgttgagttcttctagctt^ 
gtraggtcmgatttgtctgaaaagttagagcagtcagaagcccagctgggacga 
cttgccaaagctacaagagacagctgtaaaactacratagaagctatcc^^ 
gtctctgagaagtactttacctgaaagacagtatgagaaaaatattc^^ 
ctaaatat 

35 

201 753_s_at HG-U 1 33A 
ataattcaccataaaragctatctgtctgaattactt^ 

acttgataat(^tattataattttttatcatgattattgactatatmtggagtcccattgtttcagtgggcattaacagaatgctttaaa 
ttagtatacactggcacataattttttaaaaagttttaagaaaagattcatttg 

40 

201761_at HG-U133A 
taatcattttcctaagctgtccrtttgaggcttagt^ 

agttttctaggactgaaaggattctmctacattatacatgtgtgttgtcataWggcWtgctatatacmaacttcattgttaa 



1350 



BNSDOCID: <WO 03039443A2_I_> 



WO 03/039443 PCT/EP02/1 2303 

aacctatttttgaaaaacaaacttggcttgataatcatttgggcagrt^ \ 
atgtatcctgttgacttttcca 

201810_s_at HG-U133A 
5 agtaagagccaaagcagcacctccrctgagggccaggccttggagaaccggatgaagc^^ 
cataaaaatggtgcagattggctgattcatcctgggccctggccgatgtgcatatca 
naaatcttacacgtctactctaaactgctctaatgaagtttcagt 
ttgtgggtagactgtgactaggttcacagcctttgtggaacattccgtataacg^ 
tgggaagccaggccaaagtgggggcaacagcttgcttctcWctcttctcac^ 

10 

20181 1_x_at HG-U133A 

gaatgccagtggggatgcggggggatgagggtaggagggacttatagaaggggamgtggctgtgggggagaaggttctacagc^taagccttatcctgcca 
aaggggatttattctaagagaagtgratgtgaagaatggttgccactgttattagattgacaag 

agggttatgctgcactagtattccttagaggaaacagttctttaaagttaggaaagggagtaggcaggcatgtgttggcaaaggctgttaatagtagttaagtgttaaga 
1 5 ctgcttttctttaacgtmcatggtaatgcatatttagagcac*^ 

tcacagcctgtatattttagtcatttgtaaatctcttcatacaatagtgacttcttttttgactgatacag 

201812_s_at HG-U133A 

ccgtcgccatggtgaagctgagcaaagaggccaagcagagactacagcagctcttcaagggg 
20 gggatttaagaggggtgcagatcccggaatgcctgaaccaactgttttgagcctactttggggataaag 

catggaagatgtgtgctctggctcggataagagatgggacatc^ttcagtcactagttggatggcacaaggctcttcac^gacgc^tct^ 

201813_s_at HG-U133A 

cccc^aggccagactgaggctcatccagtggagaaccttcttaaacc^ctgcttccttcccggcatgcatttggc^ttggtcca 
25 gratatatggccacaaagcacagaggcttaggtttgccacatgcagacagggctttctggg 
cgtacccagtatcacactracagttagaaaagaccgaatcacaattta^ 
cttagaaggagragaaatagttattttcgtatcttctatccctcaaagcatca 
actttattcctgcattg 

30 201825_s_at HG-U133A 

aggcaacttctcataaaattcccatggttcttctccW 

aggatacagccaaggcactggtacagataagaacaaacraaatatc^aaatttgtactcaggtgaaaggacra 
ggttcaggcagccatgactcttctaagtgatgcttctcatctgcrt^ 
acaaacacggtattgagtttagtgttattagcagctctgaagtctaaacactggaagaatt^^ 
35 attcttctgtaagcctgtctga 

201 828_x_at HG-U133A 
rcctoxatgtacatttgga^ctgtcctg^ 
accagactgccagacgactaratcattctgcccacagacctgcgctgcra 
40 tcacctcggaggtatctgggctggccacagtccctggacagtgat^ 
cccagaaacctcttgtgttcttgc^aggccraggtgttcctggra^ 
caactgtctctctgatgtgtgccttttgttcaacacagtaacccctgcattctg 

201829_at HG-U133A 

1351 
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